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Preface 



This volume^ aims at introducing some basic ideas for studying approxima- 
tion processes and, more generally, discrete processes. The study of discrete 
processes, which has grown together with the study of infinitesimal calcu- 
lus, has become more and more relevant with the use of computers. The 
volume is suitably divided in two parts. 

In the first part we illustrate the numerical systems of reak, of integers 
as a subset of the reals, and of complex numbers. In this context we intro- 
duce, in Chapter 2, the notion of sequence which invites also a rethinking 
of the notions of limit and continuity^ in terms of discrete processes; then, 
in Chapter 3, we discuss some elements of combinatorial calculus and the 
mathematical notion of infinity. In Chapter 4 we introduce complex num- 
bers and illustrate some of their applications to elementary geometry; in 
Chapter 5 we prove the fundamental theorem of algebra and present some 
of the elementary properties of polynomials and rational functions, and of 
finite sums of harmonic motions. 

In the second part we deal with discrete processes, first with the process 
of infinite summation, in the numerical case, i.e., in the case of numerical 
series in Chapter 6, and in the case of power series in Chapter 7. The last 
chapter provides an introduction to discrete dynamical systems] it should 
be regarded as an invitation to further study. 

We have tried to keep the treatment of topics as independent as pos- 
sible even at the cost of some repetition; usually, we assume as known 
the content of [GMl], but, whenever possible, we provide an alternative 
elementary treatment in order to allow the use of part of this volume on 
sequences and series, independently from infinitesimal calculus. 

The main body is formed by Chapter 1, Sections 2 and 3, Chapter 2, 
Sections 1, 2, 3, and 4, Chapter 4, Sections 1 and 2, Chapter 6, Sections 1, 2, 
3, and 4 and Chapter 7, Sections 1 and 2 for about a third of the whole. The 
rest of the material may appear as heterogeneous; it develops in branches 
that eventually meet, from which it is easy to select several paths. However, 



^ This volume is a translation and revised edition of M. Giaquinta, G. Modica, Analisi 
Matematica, II, Approssimazione e processi discreti, Pitagora Editrice, Bologna, 
1999. 

^ We have discussed these notions in M. Giaquinta, G. Modica, Mathematical Analysis. 
Functions of One Variable, Birkhauser, Boston, 2003. In this volume we shall refer 
to this work as [GMl]. 
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Preface 



we believe that the whole of the material is, besides its intrinsic interest, 
fundamentally basic for any further study of mathematical analysis. 

As in [GMl] an appropriate number of exercises are distributed in the 
text and at the end of each chapter. They are marked by the symbol 
the double 11 indicates exercises that are more difficult. 

We are greatly indebted to Cecilia Conti for her help in polishing our 
first draft and we warmly thank her. We would like to thank also Alessan- 
dro Berarducci, Roberto Conti, Pietro Majer and Stefano Marmi for their 
comments when preparing the Italian edition, and Stefan Hildebrandt for 
his comments and suggestions concerning especially the choice of illustra- 
tions. Our special thanks go also to all members of the editorial technical 
staff of Birkhauser for the excellent quality of their work and especially to 
the executive editor Ann Kostant. 

Note: We have tried to avoid misprints and errors. But, as most authors, 
we are imperfect authors. We will be very grateful to anybody who wants 
to inform us about errors or just misprints or wants to express criticism 
or other comments. Our e-mail addresses are 

giaquintaQsns . it modicaOdma.unif i . it 

We shall try to keep up an errata corrige at the following webpage: 
http : //www . sns . it/~giaquinta 



Mariano Ciaquinta 
Ciuseppe Modica 
Pisa and Firenze 
October 2003 
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1. Real Numbers and 
Natural Numbers 



In this chapter, after an introductory section, in Section 1.2 we shall il- 
lustrate the axiomatic approach to real numbers, and, in Section 1.3, we 
shall identify the natural numbers as the smallest inductive subset of M. 
Further information about natural numbers will be discussed in Chapter 3, 
while the notions of sequences and of limit of a sequence^ which are spe- 
cially relevant in mathematics, are discussed in Chapter 2; in Section 2.2 
we present, in particular, several equivalent formulations of the continuity 
axiom. 



1.1 Introduction 

Rudiments of mathematics, or even refined geometrical and algebraic rules 
appear in many ancient civilizations, as for instance the Babylonian, the 
Egyptian, the Hindu, the Chinese or some of the pre-Colombian civiliza- 
tions. But mathematics as an organized, independent and reasoned disci- 
pline, that is as a science, developed from 600 to 300BC in Greece, thanks 
probably to the democratic political system of the Greeks that must have 
encouraged the attitude toward arguing. 

Thales of Miletus (624BC-546BC) is given credit for inventing the 
mathematical proof and, according to Diadochus Proclus (411-485), 
Pythagoras of Samos (580BC-520BC) 

changed the study of geometry into the form of a liberal educa- 
tion, for he examined the principles to the bottom, and investi- 
gated its theorems in an immaterial and intellectual manner. 

Most of our sources are, however, of several centuries later and refer to 
Thales and Pythagoras in a legendary and mythological way. For example 
Aristotle, reporting on the mystic-religious society of Pythagoreans, says: 

the so-called Pythagoreans applied themselves to the study of 
mathematics in so much that, having been brought up in 
it, they thought that its principles must be the principles of all 
existing things. . . . They thought they found in numbers more 
than in fire, earth, or water, many resemblances to things which 
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1. Real Numbers and Natural Numbers 




D F 

Figure 1.1. Thales’s theorem. 



are and become .... Since then, all other things seemed in their 
whole nature to he assimilated to numbers, while numbers seemed 
to be the first things in the whole of nature, they supposed the 
elements of numbers to he the elements of all things, and the 
whole heaven to be a musical scale and a number} 



a. Numbers and measurement 

It seems therefore that the Pythagoreans believed all bodies to be made 
up of a great number of corpuscules that were all identical and harmo- 
niously arranged. They identified integer numbers with patterns of those 
atoms, thus making integers the basis of measure. Geometrical entities, 
such as lines, surfaces and solids existed, as any other aspect of reality, 
as aggregations of point-numbers. This is probably why they came to the 
conclusion that the relations of capacity between two homogeneous quan- 
tities could always be evaluated in terms of the ratio of positive integer 
numbers, by counting in principle the number of corpuscules in the quan- 
tities. Concluding the argument, two homogeneous quantities seem to be 
always commensurable. 

Prom this point of view the process of measurement becomes that of 

(i) finding (with a finite procedure) a unit of measure e, possibly the 
largest, common to the quantities to be measured, 

(ii) counting; if the quantity A is n-times e, and a quantity B is me, 
then the relation between A and B is expressed by the quotient of 
the integers n and m. 

In fact the basic proofs of some geometrically relevant facts seem to be 
in favour of the assumption of commensurability. Here are a few examples.^ 

1.1 Theorem (Thales’s theorem). Let ABC and DEF be two trian- 
gles with equal angles. If the segments AB and DE are commensurable 

^ Aristotle (384BC-322BC), Methaphysics. 

^ This actually only shows that some geometric constructions preserve rationality of 
the measures of the data. 
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Figure 1.2. Area of a rectangle. 



with ratio m/n, then the pairs BC-EF and AC-DF, are commensurable 
both with ratio mjn. 

Proof. Since all angles are equal, possibly after a reflection, translation or rotation, we 
can assume that the two triangles have a common angle ZDEF = ZABC, and that 
the lines AC and DF are parallel, see Figure 1.1; we can moreover assume that A and 
C are interior points of the segments DC and EF. The commensurability assumption 
yields a segment e with the property that AB is a multiple of e with a factor m and 
DE is a, multiple of e with a factor n. This way AB and DE are subdivided respectively 
into n and m pieces equal to e. If we draw the parallel lines to AC through the point of 
subdivision of DE, we obtain a subdivision of BC and EF respectively into m and n 
equal pieces. Such a quantity, which is common to BC and EC, is the common measure 
we were looking for. 

Similarly, we can show that AC and DF are commensurable. □ 



1.2 Theorem (Area of a Rectangle). Let R be a rectangle with sides 
a and b, which are commensurable with ratio m/n with respect to a seg- 
ment e. Then R is commensurable to the square Q of side a with ratio 
m/n. 

Proof. In fact we have a = ne and b = me and, compare Figure 1.2, R = nm ex e and 
Q n? e X e. C] 

1.3 Remark. It is the commensurability of the sides of a rectangle that 
allows us to measure the area in an elementary way. 




Figure 1.3. Pythagorean theorem: + 2ab = (a + 6)^ 






4 



1. Real Numbers and Natural Numbers 



EYK AEI AOT 

STOIXEinN It* 

Eit Turi QEnMct XrH' 

£Atb *rtf 

antr* ivi ^ Je <& luf Uji 
Hi +■*"•'*'* sann^il- 






i 

i 




D E G L I 1 

ELEMENTI 

D' E V C L I D E 

LlBFtl aV IN Diet. 

CBN CJ.I 1 e H 6 t u d nr tc ft r. 
V<Jh!1ijiiL (IL i'oiJliw rftl (tmAfiffimn- 
f iDimifia COMMA ti D] H □ b.V 
f ComatrniMiJ lUiiJlirif. 

£r Imu f 

D mrCA TI A L t IKl K3 1 1 1 HO 

DON FEDERICO FELTRIO 
1>ELU HOVEH.E P^E^lCtP£ p'VXBIHa 

1 H P 1 ! Am O' ► AppttfoHimrnk Cftwrrfli.W DC3( | K. 


< » M * W 1 W J 









Figure 1.4. Frontispieces of the first printed Greek and Latin editions of the Elements 
by Euclid of Alexandria (325BC-265BC). 



1.4 Show the geometric form of the Pythagorean theorem. That is, show with a 
straight edge and compass construction that, in a right triangle, we can decompose the 
square on the sides into parts which fit exactly into the square of the hypotenuse. 

1.5 Theorem (Pythagorean theorem). Suppose that the sides and 
the hypotenuse of a right triangle are commensurable to a segment e with 
ratios respectively min, p/q and r/s; then 




Proof. The squares of the sides are commensurable to the square of side e with ratio 
respectively m^/n^, and r"^ j . The claim then follows from the geometric version 

of the Pythagorean theorem in Exercise 1.4, if we take into account that submultiples 
of a given quantity are commensurable. □ 



b. Never-ending processes 

The Pythagorean assumption that all pairs of homogeneous quantities are 
commensurable was probably supported by proofs such as the ones we have 
seen in the previous paragraph. The discovery of incommensurable pairs of 
segments, such as the side and the diagonal of a square (see Proposition 1.9 
of [GMl]), and its disclosure by Hyppasus, a member of the Pythagorean 
school, produced a deep crisis in the numerical foundations of geometry 
and on some of the dominant Greek culture, so much so that it cost the 
life of Hyppasus himself: according to the tradition, Hippasus was thrown 
overboard by the Pythagoreans. Obviously, it was not only a mathemat- 
ical foundation that was failing, but a whole conception of the world, a 
conception meant to justify social relationships and cultural superiority. 
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The procedure for determining a possible unit common to two mag- 
nitudes^ as line segments, angles, areas, can be regarded as the geometric 
equivalent of Euclid’s algorithm (see, for example, 8.25). In the case of two 
line segments ro, ri, we consider the shortest one, ri, and we cover vq with 
copies of ri . If we succeed in covering ro perfectly, ri is a common unit as 
ro is a multiple of ri. Otherwise, we consider the part V 2 which remains 
from ro after covering it with copies of ri, and restart the process using 
V 2 as the shortest segment between, this time, ri and V 2 (see, for example. 
Figure 1.5). For the Pythagoreans this process would always stop after a 
finite number of steps. ^ In fact stopping after a finite number of steps is 
exactly equivalent to commensur ability. However, the procedure will never 
stop in the case of the diagonal and the side of a square, as we have seen, 
or of the diagonal and the side of a pentagon (see, for example. Figures 
1.6 and 1.7). 

The existence of incommensurable pairs made it necessary to face pro- 
cesses that were treacherous as they did not stop after a finite number of 
steps, and to give up the idea of controlling continuous geometrical entities 
by rational numbers or finite processes. 

These reasons probably led Eudoxus of Cnidus (408BC-355BC) to 
introduce the notion of magnitude as opposed to numbers and develop a 
theory of comparison of magnitudes: the theory of proportions which is 
presented in Book V of Euclid’s Elements. This, together with the method 
of exhaustion^ due also to Eudoxus, is among the greatest achievements of 
Euclidean geometry. The method of exhaustion is presented in Book XII 
of Euclid’s Elements and finds its splendour with Archimedes of Syracuse 
(287BC-212BC) and, later, with mathematicians in the Renaissance, as 
for example Francesco Maurolico (1494-1575). 

Eudoxus’s idea is to consider equal ratios of pairs of magnitudes with- 
out any reference to numbers. Nonending processes this way disappear 
and we capture their essence via the method of exhaustion. Using modern 
notation and numbers in a nonessential way, we can state 



^ In this context one can think of Zenon’s paradoxes (about 495BC). 
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Figure 1.6. Denote by Fn? n = 1,2, , the n-th pentagon from the left to the right 
and by an and dn respectively the lengths of its side and diagonal. Then we have 
dn+i = an and an = rfn+i = an+i + «n+ 2 - The figure shows that the process of 
construction of pentagons never ends; ai and di are therefore incommensurable (see, 
for example, Chapter 3). 



1.6 Definition (Exhaustion principle). The magnitudes a and b are 
in the same ratio of the magnitudes A and B if, given arbitrarily two 
positive numbers m and n, we have 

ma < nb if and only if mA < nB, 

ma > nb if and only if mA > nB. 

Of course the previous criterion requires “infinitely many comparisons 
of capability,” but it provides a firm foundation of geometry; for instance, 
the proofs of Theorems 1.1, 1.2 and 1.5 extend easily to cover “irrational 
ratios.” Though historically not correct, we can think of the exhaustion 
method as a method for approximating irrational numbers by rationals.^ 

c. Back to numbers 

In Medieval times the centrality of the numbers came up again because of 
the new trading. The algebra brought from the Arab world by Leonardo 
Pisano (1170-1250), called Fibonacci, took a relevant role in the new com- 
mercial companies: any good is homogeneous to any other good, money is 
the unit of measure to which every quantity has to be referred. New prob- 
lems, which require numerical solutions, arose and the continuity problem 
came again as the problem of finding square or cubic roots. Scientists rec- 
ognized the irrational character of those numbers, but, unlike the Greeks, 
they learned how to live with them. 

They avoided asking themselves about the nature of these new numbers 
and were satisfied with approximations whenever irrationals appeared as 
solutions of problems. 



^ Notice that, if the ratio of magnitudes are numbers, the exhaustion principle is a 
process that leads to the equality f = ^ and amounts to showing that 

I a A\ 1 

1 6 si n 



Vn € N, n > 1. 
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Figure 1.7. The side and the diagonal of a pentagon are incommensurable. 



For centuries irrational numbers were used. Meanwhile other num- 
bers appeared. In the fifteenth century the Italian mathematicians Niccolo 
Fontana (1500-1557), called Tartaglia, Girolamo Cardano (1501-1576) and 
Rafael Bombelli (1526-1573) used even imaginary numbers in solving alge- 
braic equations of third and fourth degree, and Frangois Viete (1540-1603) 
introduced literal calculus. The bursting impact of the infinitesimal calcu- 
lus led to include even “infinity” and “infinitesimal” among numbers. Of 
course the development of mathematics, especially in the sixteenth and 
seventeenth centuries did not go without criticism, but in some sense, 
D’Alembert’s attitude allez de Vavant: la foi vous viendra mattered more. 

At the beginning of the eighteenth century, Augustin-Louis Cauchy 
(1789-1857) tried to give solid bases to infinitesimal calculus^ founding it 
on the theory of limits^ that he rigourously developed in two celebrated 
treatises: the Cours d^ Analyse and the Resume des legons sur le calcul 
infinitesimal^ respectively in 1821 and 1823. However in this process of 
revision he found a series of difficulties that could be overcome, as we 
have seen in [GMl], only after a rigorous settlement of the system of real 
numbers. 

It was only fifty years later in 1872 that Georg Cantor (1845-1918) 
and Richard Dedekind (1831-1916) formulated the axiom of continuity 
(see, for example. Section 1.2) and built a model of real numbers in the 
celebrated works Uher die Ausdehnung eines Satzes aus der Theorie der 
trigonometrischer Reihen and Stetigkeit und irrationale Zahlen. 

The system of numbers needed should be a minimal extension of the 
rationals, so that each number could be approximated by rationals. Also, 
in such a system, we should be able to compare, sum and multiply as one 
does with the geometrical continuum, but without any reference to it. 

The crucial property singled out by Dedekind to capture the intuition 
of the continuity of the line was that, in every division of the line into two 
classes of points such that every point in one class is to be to the left of 
each point in the second, there is one and only one point that produces 
the division. He carried this idea over to the existence of the supremum of 
every nonempty subset that is bounded from above. 
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Figure 1.8. Frontispieces of Johannes Herwangen Editio Princeps in Greek and Latin 
of the works of Archimedes of Syracuse (287BC-212BC), Basel 1594, and of one of the 
Oxford editions, Oxford 1696. 



d. An axiomatic or a constructive approach? 

The clarification of the mystery of the continuity of the real line due to 
Georg Cantor (1845-1918) and Richard Dedekind (1831-1916) turned out 
to be simple and consistent with the way mathematicians had dealt with 
real numbers in those years. However, the question of the existence of such 
a system of numbers still held. Actually, immediately after Dedekind’s 
works, other models of real numbers appeared. They were built starting 
from the rationals as, for instance, the one due to Karl Weierstrass (1815- 
1897). The idea that became dominant from then on was the following. 
Starting from the rationals one adds new numbers such that, if one chooses 
a reference on the line, they will occupy the holes left out by the ratio- 
nals. Then, by using the possibility of approximating the new numbers 
with rational numbers, the operations already defined on the rationals are 
extended to the former as well. 

The constructive approach brings back the existence of the system of 
real numbers, i.e., the consistency of such a system, to the consistency of 
the rationals and therefore to the one of natural numbers, in a process 
of “arithmetization of mathematics” typical of the so-called Berlin school 
around the middle of the nineteenth century, well expressed by the famous 
words of Leopold Kronecker (1823-1891) : 

Natural numbers are the work of God, all else is the work of 

man. 

Actually this process is not at all simple and requires a theory of sets, that 
is quite abstract and complex. Furthermore, it turns out that within such a 
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Figure 1.9. Richard Dedekind (1831-1916) and Georg Cantor (1845-1918). 



theory, one cannot establish whether these systems are consistent (Godel’s 
theorem, Kurt G5del (1906-1978)); in other words, one cannot establish 
whether the assumption that a system enjoys a number of properties will 
lead or not to unpleasant surprises: this is the question of the foundations 
of mathematics (see, for example. Section 3.3.2). 

We chose in [GMl], and we will insist in our choice in Section 1.2, 
an axiomatic approach to real numbers: we take for granted that there 
is a system of numbers that enjoys the properties it is expected to have, 
and within this system we shall find the subsets of rational and natural 
numbers. 



1.2 The Axiomatic Approach 
to Real Numbers 

In this section we discuss the axioms of the system of real numbers and 
some of their consequences. For the sake of convenience we deal with alge- 
braic and order properties in Section 1.2.1, and with the continuity prop- 
erty in Section 1.2.2. 



1.2.1 Algebraic and order properties 

The algebraic properties of real numbers are conveniently subsumed in a 
minimal number of axioms that give the rules of computation. Those are 
enough to allow us to derive the usual rules of computation. 
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a. Axioms for addition 

An operation of sum is defined in the system of real numbers E: it asso- 
ciates to each pair of numbers x and y their sum denoted by x ^ y. In 
other words a function is defined, the sum, 4- : E x R ^ R, (x, t/) ^ x + y, 
X, 1 / E R. We assume that 

(Ai) Addition is associative: {x y) z = x {y z) for all x^y^z E R. 

(A2) Existence of zero: R has an element, denoted by 0, such that 0 + x = 
X + 0 = X for every x E R. 

(A3) Existence of the opposite: to every x E R corresponds an element 
2/ E R such that x-\-y = y-\-x = {). 

(A4) Addition is commutative: x + 1 / = + x for all x, |/ E R. 

Prom (Ai), . . . , (A4) we infer, for example, 

(i) The number 0 in (A2), called zero or neutral element for the addition, 
is unique. In fact, for another 0 ' we infer 0 = 0 + 0 ' = 0 ' by applying 
(A2) to 0, A4 and again (A2) to O'. 

(ii) The opposite y in (A3) to x is unique. In fact, if for y, 2; E R we had 
x-i-z = x-j-y = 0, then 2; = 2^-hO = z-h(x+y) = (2^-f x)+y = 0+y = y 
by applying (A2), (A3), (Ai), (A2) and again (Ai). The opposite of 
X is usually denoted by — x and one writes x — y instead of x + (— y). 
The new operation (x, y) — > x — y is then called subtraction. 

Whenever in a set X an operation with the properties (Ai), . . . , (A4) 
is defined, we say that X is a commutative group. In this case, (i) and (iii) 
above read: in a commutative group there is a unique neutral element, 
and every x has a unique opposite. Axioms (A) for the reals can therefore 
be summarized by saying that M. is a commutative group with respect to 
addition. 



b. Axioms for multiplication 

A second operation, called multiplication, (x,y) — » xy, Vx,y E R, is as- 
sumed on R. It satisfies the following axioms: 

(Ml) Multiplication is associative: {xy)z = x{yz) for all x,y,z E R. 

(M2) Existence of identity: R contains an element, denoted by 1, such 
that 1 7 ^ 0 and lx = xl = x for every x E R. 

(M3) Existence of the reciprocal: to each x E R, x 7^ 0 , corresponds an 
element w eR such that wx = xw = 1. 

(M4) Multiplication is commutative: xy = yx for all x,y E R. 

Similarly to addition one easily proves that the identity is unique and 
the reciprocal of each element is unique. Usually one denotes by x”^, 1/x or 
by ^ the reciprocal of x 7^ 0 . We emphasize that 0 ”^ or 1/0 is not defined 
and it is meaningless. It is easily seen that R \ { 0 } is a commutative group 
with respect to multiplication. 
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Figure 1.10. Frontispieces of A treatise of Algebra by John Wallis (1616-1703) and of 
Artis analyticae praxis by Thomas Herriot (1560-1621) where probably the symbols < 
and > first appear. 



1.7 Show that rotations of the plane around a given point form a commutative 
group, the operation of sum of two rotations, respectively of angles x and y, being 
defined as the rotation of angle x y. Since we can clearly identify rotations of the 
plane with the unit circle in M?, we can say in a fancy way that the circle has the 
structure of a commutative group. 

1.8 f. Show that rotations of the space around a given point form a group which is 
however not commutative, that is, rules (Ai), (A2), (A3), but not (A4) hold. Again in a 
fancy way we can say that the unit sphere in has the structure of a noncommutative 
group. 



c. The distributive law 

The next axiom defines the relationship between the operations of sum 
and multiplication. 

{AM) x{y z) = xy A- xz holds for all x^y^z e E. 

All algebraic rules of computation follow from the axioms (A), (M) 
and (AM). 

1.9 f. Show that 

(i) 0 • X = 0, 

(ii) {-x){-y) = xy, 

(iii) {-x)y = -{xy), 

(iv) {x - y)z = xz- yz, 

(v) xi/ = 0 if and only if either a: = 0 or y = 0. 

[Hint: As an example let us prove (i). We have 0 • x + x = Ox + lx [by (M2)] = 
(0 + l)x [by (AM) and (M4)] = lx [by (A2) ] = x [by (M2)]. Summing to both sides 
—X we then infer Ox = 0x+(x+(— x)) = (0x+x)-f (— x) by (Ai) = x+(— x) = 0 by (A3).] 






12 



1. Real Numbers and Natural Numbers 




Figure 1.11. Augustin- Louis Cauchy 
(1789-1857) and the frontispiece of his 
Cours d’ Analyse, 1821. 
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If X, y G R, X 7^ 0 the quotient of y by x is defined as 



y i 

- := y- = yx 

X X 



-1 



We also write y/x for x 7^ 0. The ordinary rules of computation of 
fractions, as for instance 



a c ac a c ad-\-bc 

bd^bd’ b^d^ bd 



follow easily from the axioms for multiplication. We repeat: dividing by 
zero is not allowed. 



d. Order 

We can identify in R a subset P, called the subset of positive numbers, by 
means of the following two axioms: 

(01) If X, y are positive numbers, x,y £ P, then x + y and xy G P. 

(02) For each x G R only one of the following three alternatives holds: 
X G P, X = 0 or —X G P. 

(Oi) and (O2) imply that 1 is positive. In fact, since 1 7^ 0, either 1 or 
— 1 is positive and, as 1 = = (—1)^ we conclude that 1 is positive. A 

nonzero number which is nonpositive, is called negative. We write x > 0 
to say that x is positive, while x > y or y < x mean that x — y is positive. 
Consequently x < 0 means that x is negative, and x negative, x < 0, is 
equivalent to — x is positive, — x > 0. One can show that if x, y are negative. 
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Figure 1.12. Frontispieces of the Fondamenti per la teorica delle funzioni di variabili 
reali by Ulisse Dini (1845-1918) and of the Italian translation by Oscar Zariski (1899- 
1986) of Stetigkeit und irrationale Zahlen by Richard Dedekind (1831-1916). 



then xy is positive. In fact xy = {—x){—y) and —x and —y are positive. 
In particular the square of a nonzero real number is positive. 

Prom the previous axioms it is not difficult to infer the usual rules to 
deal with inequalities: 

(i) if X < y and y < then x < z^ 

(ii) if X < y and 2 ; > 0, then xz < yz, 

(iii) if X < y and z gR, then x z < y z, 

(iv) if X < y and x > 0, then ^ 

(v) if X < y and 2 ; < 0, then xz > yz. 

Finally, since 1 is positive, also 2 := 1 + 1, 3 := 1 + 1 + 1, 1 + lH hi, 

and so on are positive. 



1.2.2 Continuity property 

a. Supremum 

Let ^ be a nonempty subset of R. We recall (see, for example. Section 1.1 
of [GMl]) 

o c G R is an upper bound of A if A c] — 00 , c], i. e., if x < c Vx G A, 
o ^ is bounded above if it has an upper bound, i.e., if there exists c G R 
such that X < c Vx G A, 

o c is the greatest element of A^ or a maximum of A, if c is an upper bound 
of A and cG A, 
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o c G M is a lower bound of A if x > c E A, 

o ^ is hounded below if it has a lower bound, i.e., if there exists c G R such 
that X > c\/x E A., 

o c G R is the least element of A^ or a minimum of A, if c is a lower bound 
and c G A, 

o c is the infimum of A if c is the greatest lower bound. 

1.10 Definition. Let A be a nonempty subset of R bounded from above. 
The least upper bound of A, in short the l.u.b. of A, is also called the 
supremum of A and denoted by sup A. 

Whenever they exist, the maximum and the supremum are unique, 
moreover, if both exist, then they agree. Clearly the supremum is charac- 
terized by the following 

1.11 Proposition. Let A C R, A L E R is the supremum of A if 
and only if 

(i) L is an upper bound of A, i.e., x < L\/x E A, 

(ii) Ve > 0 L — e is not an upper bound of A, i.e., Ve > 0 3x E A such 
that X > L — 6. 

The axiom of continuity of the reals is then (see, for example. Section 1.1 
of [GMl]) 

(C) Every nonempty subset ofR that is bounded above has a least upper 
hound. 

1.12 Remark. If c is an upper bound of A, then every number larger 
than c is again an upper bound of A. We are tempted to say that the 
upper bounds of A form a half-line, but to identify it we need the left 
extremal point! A geometric way of visualizing the axiom of continuity is 
exactly saying that if A C R is nonempty and bounded above, then all 
upper bounds of A are given by the numbers in [sup A, +oo[. 

1.13 Example. If A =] — oo, c], c is the maximum of A. All upper bounds of A are 
given by the numbers in the closed half-line [c, -|-oo[ , and c = sup A. 

If A =] — oo, c[, A has no maximum, all upper bounds of A are again the numbers in 
the closed half- line [c, +oo[, and c is the supremum of A. In both cases the set of upper 
bounds is given by the closed half-line [c, -t-oo[, and the l.u.b. is c, therefore it exists. 

Similarly we have 

1.14 Proposition. Let A C R, A 7 ^ 0. L G R is the infimum of A if and 
only if 

(i) L is a lower bound of A, i.e., L < x \/x E A, 

(ii) Ve > 0 L + e is not a lower bound of A, i.e., \/e > 0 3x E 
A such that x < L e. 



Equivalently, the axiom of continuity can be restated as 
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(Cl) Every nonempty subset A C R which is hounded below has a greatest 
lower bound. 



b. The extended real line 

As we have seen in [GMl], it is convenient to introduce the symbols +oo, 
— oo, R := R U {+00, —00} and write sup A = +00 if A is not bounded 
above, and inf A = —00 if A is not bounded below. With these agreements 
the axiom of continuity transforms into 

(C2) Every nonempty subset o/R has supremum and infimum. 



c. Dedekind cuts of R 

The continuity property was stated by Richard Dedekind (1831-1916) in 
terms of cuts. 

1.15 Definition. Let X be a set in which the axioms (A), (M), (AM) 
and (O) hold. A cut (A, B) of X is a subdivision of X in nonempty subsets 
A and B such that AUJB = X, AflB = 0 and 

Va G A and Mb ^ B we have a <b. 

If (A, B) is a cut of X, we say that x £ X corresponds to (A, B) or that 
it brings about this cut if a < x < b Va £ A, Vb £ B. 

Clearly the element that brings about a cut is unique, if it exists, and 
belongs either to A or to 

1.16 Theorem. Let X satisfy the axioms (A), (M), {AM) and (O). The 
following 

(i) the axiom of continuity (C) holds in X, 

(ii) to every cut of X corresponds an element of X 

are equivalent. 

Following Dedekind, we can then state the axiom of continuity also as 

(C3) To every cut of X corresponds an element of X. 

Proof of Theorem 1.16. (i) (ii). Let (A,B) be a cut in X. Clearly A is bounded 

above. Set xq — sup A. We show that xq brings about the cut (A,B). Since xq is an 
upper bound, we have a < xq for all a G A. Since xq is the least upper bound of A, 
Xq <h for all b £ B. 

(ii) =^> (i). Let E he a, nonempty subset of X that is bounded above. Denote by M{E) 
the set of upper bounds of E. Clearly A := R\M{E) and B := M{E) form a cut (A, B) 
of X. Denote by xq the element of X corresponding to {A,B). We then show that xq 
is an upper bound of E. Otherwise, there is an x £ E such that x > xq and choosing 
xi := (ccQ + x)/2, we get xi € M{E) since x\ > xq and xi ^ M(E) since xi < x £ E, 
a contradiction. Since moreoever xq < 5 V6 G M{E), xq is the l.u.b. of E. □ 
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The axiom of continuity is not valid in Q. For example, if A := 
{x G Q I 0 < < 2}, A is nonempty, bounded above, and sup A = \/2 G R; 

but, being that y/2 is not in Q, A has no supremum in Q. 



1.17 The existence of the n-th root of a nonnegative real number is a consequence 
of the continuity of the function x > 0 (see, for example, 2.46 of [GMl]). Relying 
on the axiom of continuity, prove that every nonnegative real number has an n-th root. 



1.2.3 Uniqueness of reals 

We have already hinted at the fact that it cannot be decided whether the 
system of reals is consistent or not. Another important question is the 
uniqueness of such a system. This is not clear a priori; both the rationals 
and the reals satisfy the algebraic and order axioms, but Q ^ R. Fortu- 
nately two numerical systems S and T which satisfy the algebraic, order 
and continuity axioms are undistinguishable. Let us be more precise on 
this point. 

A set S with two operations, called addition and multiplication, which 
satisfy the axioms (A), (M), (AM), and (0), is called an ordered field. For 
example R and Q are ordered fields. An ordered field is said to be complete 
if the axiom of continuity holds. 

An algebraic isomorphism / : 5 5' is a bijective correspondence 

between S and 5' that is compatible with the operations of addition and 
multiplication on S and 5', that is, such that 

f{x +s y) = f{x) +s' fiy), f{x -s y) = f{x) -s' f{y) 

for all X, y G S. We say that the isomorphism is order preserving when 
/(x) is positive in S' if and only if x is positive on 5, i.e., 

X <syii and only if /(x) < 5 / f{y) Vx, y G S'. 

Finally, we say that the ordered fields S and S' are isomorphic if there is 
an algebraic isomorphism f : S S' that preserves the order. We then 
have 

1.18 Theorem. Every complete ordered field S is isomorphic to R, con- 
sequently all complete ordered fields are isomorphic. 

Proof. Since S is complete, any of its bounded subsets has supremum in S. Also, if 
/ : 5 — > 5' is an isomorphism between two complete ordered fields S and S', then we 
have f{s\ipE) = sup f{E). 

We can now easily construct a bijection f : N S between N and a subset of S 
which preserves operations and order; ^ we can also extend this bijection to a bijection 
/ of E onto a subset S' C S. To conclude, it suffices to prove that /(E) = S. Suppose 
/(E) ^ S, i.e., that there is x E: S \ S' and let Q' := /(Q), and consider the sets 

{x G I X < x} and {x G | x > x}. 

^ Compare the next section concerning the subset of naturals in E. 
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They are clearly nonempty, otherwise Q' would be bounded below or above. If 
i' := sup{x G Q' I X < x}, and L' := inf{x G Q' | x > x}, 
we have L' G S' and 

L' <x< i', 

otherwise x G S". Therefore we conclude that there is p G Q' such that L' < p < V\ 
this is a contradiction, as it would imply that there are no rationals between i and L, 
where := /(^), V := /(L). □ 



1.3 Natural Numbers 

In this section we identify the subset of R of natural numbers. 



a. Natural numbers and the principle of induction 

We commonly say that natural numbers are the numbers 0, 1, 2, 3, and 
so on, actually meaning that there is a never-ending rule producing all 
natural numbers which is 

(i) 0 is a natural number, 

(ii) if a: is a natural number, then adding 1 produces the next natural 
number, the “successor” x -\-l of x. 

Even more, we intend that this rule generates only natural numbers. 

To be more precise, let us state first 

1.19 Definition. A subset A C R is said to be inductive if 

(i) 0 G A, 

(ii) if X e A, then a: + 1 G A. 

The entire R, the half-lines [— 1, oo[ and [0, oo[, the subset 

{o, 1/2, 1, 3/2, 2, 5/2, 3, . . . } 

are examples of inductive subsets of R. The naive way to describe the 
naturals suggests that 

(i) the set of natural numbers is inductive, 

(ii) no proper subset of the naturals is inductive, 

(iii) the subset of naturals is the smallest inductive subset of R. 

For these reasons we set 

1.20 Definition. N is the smallest inductive subset of R. 

A trivial consequence is the following. 
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1.21 Proposition (Induction principle). If A C N is inductive, then 
A = N. 

The Definition 1.20 can be justified by naive set theory. In fact we can 
define N as the intersection of all inductive subsets of R, 



N:= 




C R 



A is inductive 



}■ 



This way the existence of N leads to set theory. Then we need to show that 
N is inductive itself. Consequently N exists and is the smallest inductive 
subset of R. 



1.22 Ilf. Show that N := n{A C M | ^ inductive} is inductive. 

To be consistent, it remains to show that the operations of R, when 
restricted to N, yield the usual operations on N. This is summarized in the 
following 

1.23 Proposition. We have: 

(i) if n G N, then n + 1 G N, 

(ii) if n, m G N, then n-\~m and nm G N, 

(iii) if n G N and n> 0, then n — 1 G N, 

(iv) if n, m G N and |n — m| < 1, then n — m, 

(v) every nonempty subset A C N has a minimum, 

(vi) a subset A cN is bounded if and only if it has a maximum. 

Proof. Notice that in principle n 4- 1, n + m, nm, are real numbers. 

(i) It is trivial, since N is inductive. 

(ii) For n G N set An ■= {m G N | n + m G N). It is easily seen that An is an inductive 
subset of R. Thus by the induction principle An = N, that is n + m G N Vm and fixed 
n. The claim follows since n is arbitrary. One can argue similarly for the product of 
naturals. 

(iii) The set A := {0} U{nGN|n — 1 GN}is inductive, hence A = N, in particular 
n— iGNifn^^O. 

(iv) We claim that the set A := {n G N | ^ m G N, n < m < n + 1} is inductive. In fact, 
if we found a natural number m with 0 < m < 1, then m — 1 < 0 and (iii) would give 
m — 1 G N, i.e., a contradiction. Similarly, we show that if there is no natural number 
between n G N and n + 1, then there is no natural number between n + 1 and n + 2. 
By the induction principle, A = N. 

(v) Let £ := inf A. If € is not the minimum of A, we can find (by the properties of the 
infimum) x,y £ A CN with i < y < x < i y 1/2. A contradiction to (iv). 

(vi) Let A C N be bounded and let £ := sup A G M. Suppose I is not a supremum, then 

there are n,m £ A such that i— l<n<m<£. Since A C N, we reach a contradiction 
to (iv). □ 



1.24 Axiomatic definition of naturals. Natural numbers can also be defined ax- 
iomatically independently from the reals. They are a set M with an application a : H — » N 
called successor, satisfying the following five axioms: 

(i) OgN, 
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Figure 1.13. Giuseppe Peano (1858- 
1932) and the frontispiece of Arith- 
metices Principia, Torino, 1889. 




NOVA MKTHODO EXPOSITA 



lOeEPH PEAHO 
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(ii) if a G N, then cr(a) G N, 

(hi) if a G N and a = cr(6), then a ^ 0, 

(iv) cr is injective, i.e., if the successors of a and b are equal, then so are a and 6, 

(v) if A C N is such that 0 G A, and a e A implies <r(a) G A, then A = N. 

Axioms (i)-(v) were introduced by Giuseppe Peano (1858-1932), who also showed how 
one can derive from them the entire arithmetic: they are known as Peano ’s a^om. 
Starting from natural numbers one can build successively the system of signed integers, 
denoted by Z, of rationals Q and of reals E. 

Prom Proposition 1.23 (vi) we in particular infer 

1.25 Theorem (Archimedean property). N is not bounded above in 
R, i.e., given any M G R there exists n e N such that n > M. 

It is convenient to state the Archimedean property of R in several 
equivalent forms. 

1.26 Proposition. The following equivalent claims hold: 

(i) if M > 0, then there exists n eN such that n> M, 

(ii) (Archimedean property) if x, y eR are positive numbers, then 
there is n e N such that nx > y, 

(hi) for every e > 0 there exists p G N such that \ jv <t, 

(iv) if X G R is such that |x| < Vn G N, n > 1, then x = 0. 

Proof, (i) is true by Theorem 1.25. 

(i) =^> (ii). It suffices to apply (i) with M := y/x. 

(ii) (hi). It suffices to apply (ii) with y := 1 e x := e. 

(hi) (iv). If a: ^ 0, we apply (hi) with e := |a;| and find u such that l/i/ < |o;|: a 
contradiction. 
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Figure 1.14. Rationals are dense. 



(iv) (i). Were N bounded above, we would find L such that L > n ^ N, hence, 
according to (iv) 1/L = 0: a contradiction. □ 

We repeat: the possibility of dividing an interval of R in subintervals 
as small as we want is equivalent to the unboundedness of N in R. 

b. Approximation of reals by rational numbers 

Starting from the natural numbers we define the (relative) integer numbers 
as 

Z := G R |a:| G n| 
and the rational numbers as 

Q := |x G R X = ^, G N, g ^ o|. 



1.27 Definition. We say that A C R is dense in R if for any pair of 
distinct real numbers x, p G R, x < y, there is a £ A such that x < a <y. 

Clearly the following claims are equivalent: 

o A c R is dense in R, 

o if e > 0, X G R, then there is a G A such that |x — a| < e, 
o if n G N and x G R, then there is a G A such that |x — a| < 1/n. 

1.28 Theorem. The subset of rationals Q C R is dense in R. 

Proof. Let n G N and let us prove that, if x > 0, then there is a rational 
r > 0 such that |r — x| < 1/n. 

(i) If 0 < X < 1/n, we take r := 0 as |x — 0| = x < 1/n. 

(ii) If X > 1/n, we define, compare Figure 1.14, 

A := |m G N, ~ ^ 

Since lGAasl/n<xby the Archimedean property, A is nonempty. 
Moreover A is bounded above (by nx), hence it has a maximum k. 
We must have fc < nx < (fc + 1), hence 

k 1 

X < -. 

n n 
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Finally, if x < 0 we j&nd r > 0, r € Q, such that \ —x — r\ < 1/n, hence 
\x — (— r)| = I — a: — (r)| < 1/n. □ 

1.29 Theorem. The subset of decimal fractions, written as 
{m/10’^|m E Z, n G N}, and, more generally the subset of fractions 
of the type {mfp'^ | m G Z, n G N}, where p G N, p > 1, is dense in R. 

1.30 f . Prove Theorem 1.29. [Hint: Compare the proof of Theorem 1.28 and use that, 
if n G N and p > 2, then > n.] 

The last theorem says that if x G R, then we can find a finite decimal 
expansion m/10'^ as close to x as we want More precisely, if we fix the 
deviation e > 0 and apply Theorem 1.29 in the interval ]x — e, x+e[, we find 
an approximate finite decimal expansion m/10'^ with 0 < x — m/10’^ < e. 
Notice also that not every rational is a finite decimal: for example 1/3 
cannot be expressed as a decimal fraction. 

c. Recursive statements 

The induction principle has also the following useful formulation. 

1.31 Proposition. Suppose that for every natural number n G N we are 
given a statement p(n). 

(i) Suppose that the statement p(0) is known to be true. 

(ii) Suppose that for any n, if the statement p(n) happens to be true, 
then the statement p(n + 1) must also be true. 

Then the statement p{n) must be true for all n. 

Proof. Proposition 1.31 is quite convincing: it is equivalent to the induction principle. 
In fact the assumptions (i) and (ii) just say that the set .4 := {n G N [ p{n) is true} is 
inductive, hence 4 = N by the induction principle. □ 

Of course we also have 

1.32 Proposition. Suppose that for every n we are given a statement 
p{n). 

(i) Suppose that there is k eN such that p{k) is true. 

(ii) Suppose that for all n, if the statements p(k)y p(k + 1), . . . , p(n) 
happen to be true, then p(n + 1) must also be true. 

Then the statement p(n) must be also true for all n > k. 

1.33 Example. We show that 2^ > ^?Vn > 0. Let p{n) be the statement “2^^ > n.” 
We have 

(i) p{0) = “2® = 1 > 0” is true. 

(ii) Prom “2® = 1 > 0” by adding 1 to both sides we then infer 2^ = 1 + 1>1 + 0 = 1 
i.e., p(l) is true and, from 2^ > n we get 2’^"’"^ =2-2^ > 2n > n + 1, i.e., p{n + 1) 
is true. 

Proposition 1.31 then yields the estimate 2^ >n for all n. 




22 



1. Real Numbers and Natural Numbers 



1.34 Example (Bernoulli’s inequality). We can give a proof of Bernoulli's inequal- 
ity (see, for example, 5.52 of [GMl]) that makes no use of calculus. In fact for n = 0 
we have (1 + h)° = 1 = 1 + 0 • h. If now n G N and we assume (1 + h)'^ > 1 + nh, then 

(1 + = (1 + h){l -h h)'^ > (1 + ^)(1 + 'i^h) [since h > —1] 

= 1 -}- (ti -j- l)h -j- ^ 1 ”1“ "f“ l)h. 



1.35 Example (Arithmetic and quadratic mean). Let us show by induction that 

2 






For n = 1 the claim is trivial. Suppose the claim true for n and let us prove it for n + 1. 
We have 



f n+l \ ^ \ 

+2a„+i53“j +“n+l- 

Vj=i / \J=1 / J=1 

v2 . ^ 



(1.1) 



Prom the inequality 2ap < ea^ + ^ , which holds for all a, /? € M and e > 0, we infer, 
for e := 1/n, that 

2 



n 1 / ^ \ 

2an+l ^ ^ ^®n+l I ^ ^ ^3 I • 

J = 1 \j = l / 



( 1 . 2 ) 



Formulas (1.1) (1-2) and the inductive assumption then yield 
2 / \ 2 



n) + (n + l)an+l < (^^“) “j + (” + l)<»n+l 



n+l 

J=1 



1.36 Example (Sum of the first n naturals). There is a closed formula for the 
sum Si (n) of the first n naturals. 



5i(n) :=1 + 2H hn = ^ j = 



n(n H- 1) 
2 



This can be proved in several ways, 
(i) Writing 



(1.3) 



iSi(n) — 1 + 2 + 

5i(n) = n + (^-1) + 

and summing we get. 



+ (n - 1) + n, 

+ 2 + 1 , 



2 * 91 ( 71 ) — (n + 1) + (n + 1) + • • • + (n + 1) = n{n + 1). 



(ii) Arranging squares of side 1 as in Figure 1.15, the total area of the shadow squares 



2 
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Figure 1.15. j = n(n + l)/2. 




(iii) Using the identity {a + 6)^ = + 2ah + 6^, we can write 

= + +1 
2 ^ = + 2-1 + 1 

32 = 2^ + 2-2 + 1 

'n? = (n — 1)^ + 2 - (n — 1) + 1 

(n + 1)^ = + 2 - n + 1 

and, summing, 

l2 + 22 + . . . + (n + 1)2 = i2 + 2^ + • • • + + 25i(n) + (n + 1), 

that is, 2 5i(n) = (n + 1)2 — (n + 1) = n(n + 1). 

(iv) By induction: the sequence Xn := n{n + l)/2, n > 1, satisfies the recursion 

{ XI = 1 , 

aJn+i = + (n + 1), Vn > 1, 

which defines 5i(n). 

1.37 Example. The sum of the first n odd naturals is 

n n 

^(2j — 1) = 2 ^ j — n = n(n + 1) — n = n^, 

see Figure 1.16. 
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4 


Figure 1.17. The Josephus problem for n 


= 5,8. 





1.38 Example (Sum of the squares of the first n naturals). There is a closed 
formula for 

n 

S2{n) := 1 + 4 -h 9 H h 

3=1 

In fact the method (iii) in Example 1.36 extends to the present case. Using the identity 
(a -j- b)^ = + Sa^b -H 3a6^ + we write 



l3 = 




+ 




+ 




4- 


1 


23 = 


13 


+ 


3- 12 


+ 


3-1 


-H 


1 


33 = 


23 


+ 


3-23 


+ 


3-2 


+ 


1 


n3 = 


(n-l)3 


+ 


3 • (n - 1)3 


+ 


3 • (n - 1) 


+ 


1 


(n+l)3 = 


n3 




3 • n3 


+ 


3 • n 




1. 



Summing, we then get 

l3 _j_ 2 ^ ^3 _j_ _l_ 2)3 _ 2 ^ _j_ 2 ^ -f- . . . -|- 4- 35^2 (n.) + 3«9i(n) 4- (n + 1), 

that is 

352(n) = (n 4- 1)^ — (n 4- 1) — 35i(n), 
i.e., because of the value of Si{n) in Example 1.36, 

^ n(n + l)(2n + 1) ^ ^ ^ 

J = 1 ^ 

We can also prove it by induction observing that the sequence Xn := n{n + l)(2n 4- 
l)/6, n > 1, satisfies the recursion 



xi — 1 , 

Xn+l = Xn + (n + 1)^ Vn > 1 



that defines »S 2 (n). 

1.39 Example (The Josephus problem). Consider the following variant of a story 
told by Flavius Josephus, a Jewish historian of the first century. We consider n people 
numbered from 0 to n — 1, and, starting with the person labelled 1, we eliminate every 
second remaining person until only one survives, see Figure 1.17. We are asked to 
determine the position T(n) of the survivor. 

We easily see that T(l) = 0, T(2) = 0, T(3) = 2, T(4) = 0. For large n we may 
argue recursively. If the number of people is even, after the first round only the even- 
numbered people survive and the next to be eliminated is labelled 2. We are therefore 
in the situation of p people numbered 0, 2, . . . , 2p — 2. In formula. 
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T(2p) = 2T{p). 

If n = 2p + 1 is odd, after one round p people labelled 2, 4, . . . , 2p — 2, 2p survive and 
the next to be eliminated is person number 2. Hence 

T{2p + 1) = 2T{p) + 2. 

Therefore the sequence of the T(n) satisfies the recurrence 

V(l) =0, 

- T(2n) = 2T(n), Vn > 0, (1-4) 

T{2n + 1) = 2T(n) + 2, Vn > 0. 



This is the table of T(n) for n = 0, , 1, . . . , 16. 



n 


1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


11 


12 


13 


14 


15 


16 


T{n) 


0 


0 


2 


0 


2 


4 


6 


0 


2 


4 


6 


8 


10 


12 


14 


0 



This suggests for T(n) the closed form 

T(n) = 2(n - 2*'), if 2*= < n < 2'“+^ (1.5) 

This is in fact the case as one proves, checking that the sequence {xn} in (1.5) satisfies 
the recurrence (1.4). 



1.4 Summing Up 

Real Numbers 

The system of real numbers R is defined axiomatically as a set of objects satisfying a 
suitable family of rules. First, we can operate on it with addition, multiplication and 
order in the usual way: this is summarized by saying that M is an ordered field. Secondly, 
a “continuity property,” that can be expressed in several equivalent forms, holds. 

o Let A C M be nonempty. The supremum of ^4, denoted by sup^4, is the least upper 
bound of A, that is, the unique number L € R such that 

(i) L is an upper bound of A, i.e., x < L Vx £ A, 

(ii) Ve > 0 L — e is not an upper bound of A, i.e., \/e > 0 3x £ A such that x > L — e. 
o Let A C R be nonempty. The infimum of A, denoted by inf A, is the greatest lower 

bound of A, that is, the unique number i £R such that 

(i) ^ is a lower bound of A, i.e., x > £ Vx £ A, 

(ii) Ve > 0 ^ -t- e is not an upper bound of A, i.e., Ve > 0 3x G A such that x < i + e. 
o A cut (A, B) of R is a subdivision of R in nonempty subsets A and B such that 

AUB = R, An B = 0 and 

"ia £ A and "ib £ B we have a < b. 

If (A, B) is a cut of X, we say that x £ X corresponds to (A, B) if a < x < 6 Va G A, 
Wb£B. 

The axiom of continuity of the reals can be expressed by one of the following 
equivalent statements: 

o every nonempty subset A C R that is bounded above has supremum, sup A G R, 
o every nonempty subset A C R that is bounded below has infimum, inf A G R, 
o to every cut (A, B) of R corresponds an element of R. 
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Natural Numbers 

A set A C M is inductive if 0 G A and, if x G A, then a; + 1 G A. The set of natural 
numbers is the subset of R defined by 

o N is the smallest inductive subset of R. 

Relevant facts about naturals are the following: 

o Induction principle If A C N is inductive, then A = N. 

o Induction principle Suppose that for every natural number n G N we are given a 
statement p{n) and let /c G N. 

(i) Suppose that the statement p{k) is known to be true. 

(ii) Suppose that for any n > fc, if the statement p{n) happens to be true, then the 
statement p{n +1) must also be true. 

Then the statement p{n) must be true for all n > A;, 
o Archimedean property N is not bounded above in R, 
o every nonempty subset A C N has a minimum, 
o a subset A C N is bounded if and only if it has a maximum. 



Rationals 

The integral numbers Z and the rational numbers Q are defined respectively by 
Z := G M I |x| G n|, Q := G M | a: = -, p,q e Z,q ^ 0^. 
o Q and the irrationals M \ Q are dense in R. 



1.5 Exercises 



1.40 Show that M+ := {x G E | x > 0} is a multiplicative group. Establish an 
isomorphism of groups between the additive group R and the multiplicative group M-|_ . 



1.41 f . Let X be an ordered field. Show 



Proposition. Let A C X. A has a maximum if and only if A is nonempty bounded 
above, has supremum and sup A £ A. In this case max A = sup A. 



1.42 f . Show that 

(i) inf A < sup A. 

(ii) If 0 ^ A C B C M, then inf B < inf A < sup A < sup B. 

(iii) Let A, B C R be such that a < 6 for all a G A, 6 G B. Then inf A < inf B and 
sup A < supB. 
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1.43 1[. Given A, B CR and 7 € R, 7 > 0, define 

A B ’.= GR| 3 J = ci-f-6, aG A, b G -^}5 
A- B := {x eR\x = ab, a e A, be B], 
'•yA := {x G R I a; = 7a, a G A}, 

-A := {x G R| - X e A}. 



Show that 

sup(A B) = sup A sup B, 
If moreover A, B C R+, then 

sup(.4 • B) = sup A • sup B, 
sup( 7 A) = 7 sup A, 
inf(— A) = — sup A, 



inf (A + B) = inf A H- inf B. 



inf (A • B) = inf A • inf B, 
inf ( 7 A) = 7 inf A, 
sup —A = inf A. 



1.44 i[. Let X be an ordered field. Show that min A = — max(— A), sup A = — inf (—A) 
for all A C X. Deduce that the axioms (C) and (Ci) are equivalent. 

1.45 f Prove 



Proposition. Let X C R be such that 

(i) OGX, 

(ii) if X e X, then x + 1 G X, 

(iii) if 0 < X G X, then x — 1 G X, 

(iv) every nonempty subset A of X has a minimum. 

Then X = N. 

[Hint: The statements (i) and (ii) say that X is inductive, hence N C X. Assume X \ N 
is nonempty ] 

1.46 Theorem 1.28 says that between two distinct real numbers there is a rational 
one. Show that actually there are infinite many rationals. 



1.47 Show that irrational numbers R \ Q are dense in R. [Hint: Proceed similarly 
to the proof of Theorem 1.28.] 

1.48 Show that 2 + >/3 and + \/3 are irrational numbers. 

1.49 Given four rational numbers a, 6 , c and d with ad — be ^ 0 and an irrational 

number x such that cx + d ^ 0 , show that is an irrational number. 



1.50 Let m,n be natural numbers with y/m irrational. Show that y/m -f is 
irrational for all fc G N. 

1.51 i[. Show that, if a + by/2 + c\^ = 0, then a = 6 = c = 0. 

1.52 Show that log^g 2 is irrational. 

1.53 f . Show that the set of rationals B := {q e Q\q^ ^ 2 } has supremum (in R) 
given by y/2. 
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1.54 f Tarski’s paradox. All numbers are equal. We proceed by induction on the 
number of numbers. The claim is trivial for one number a, a = a. Suppose that the 
claim is true for 3 numbers and let us prove it for 4 numbers, a, b, c, d. We know that 
a = b = c and b = c = d hence a = b = c = d. By induction the claim is proved. Where 
is the error? 

1.55 Show Proposition 1.32. 

1.56 f. Show that 

n! > 2^^ Vn > 4, 

2^^ — n > Vn > 5, 

> n! Vn > 1. 

1.57 if Ovals. Ovals are boundaries of convex figures in the plane. Draw in the plane 
n ovals. Suppose that each one intersects any other in exactly two points and that no 
more than three ovals meet at the same point. In how many regions is the plane divided 
by the ovals? 

1.58 if. Let I be an interval and let <^ : J -4 R be convex. Show by induction the 
discrete Jensen inequality, compare Proposition 5.62 of [GMl]: 

/ n \ n 

^ ( 1 - 6 ) 

\i=l / i=l 

for all nonnegative Ai, A 2 , . • • , An with Ai = 1 and all xi, 0 : 2 , ■ • • , G /. 

1.59 if Lagrange’s identity. Show that 

\i=l / \i=l / \i=l / i,j 

1.60 if. Given reals Ai, . . . , An with 0 < Aj < 1, show that nr=i(l“^i) ^ l — IliLi 

1.61 if if. Show that < ((n+l)/2)’^. [Hint: Show that n!^ = nr=i ^(^^+1— 

and that for all A;, 1 < fc < n, we have n < k{n + 1 — k) < ^(n + 1)^.] 

1.62 f if. Let i? be a rotation of the plane around the origin of an angle a incommen- 
surable with 7T. Denote the composition of R with itself, R^ = RoRoRo->oR, 
n-times. Given a point 0 on the unit circle, show that the orbit of 0, i.e., 

I 2 = R^B, n e n|, 

is dense in the circle. 

1.63 Ilf. Let a G M \ Q. Show that {ma — n | m, n G N} is dense in R. Deduce that 
{sinn I n G N} is dense in [—1, 1]. 

1.64 if Galileo. Show that 

1 _ 1-H3 _ l-f-3-h5 _ 

3‘’5-f-7“7 + 9 + ll”’'*' 



1.65 if. Find closed formulas for • 
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1.66 %. Compute 






J + l),Vn > 1, 



J = 1 






E^'0' + l)(-? + 2)(i + 3). 

J = 1 



1.67 ^ Nicomachus’s theorem. Show that 



= Yl^ ) ’ Vn> 1. 
j=i Vj=i 



1.68 ^ Catalan’s identity. Show that 



n - 2n - 

Er-7 = E(-ir-'7’ 






Vn > 1. 



j=i 



1.69 1. n straight lines are said to be in a generic position if they intersect each other 
at one and only one point. Determine how many regions are delimited by n straight 
lines in generic position in the plane. 



1.70 % Show that (P = 2^. 
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Thales of Miletus Pythagoras of Samos 

(624BC-546BC) (580BC-520BC) 

Hippocrates of Chios Hippias of Elis Plato Aristotle Eudemus of Rhodes 

(470BC-410BC) (460BC-400BC) (428BC-347BC) (384BC-322BC) (350BC-290BC) 

Eudoxus of Cnidus 
(408BC-355BC) 

Euclid of Alexandria 
(325BC-265BC) 

Aristarchus of Samos Eratosthenes of Cyrene 

(310BC-230BC) (276BC-197BC) 

Archimedes of Syrekcuse 
(287BC-212BC) 

Apollonius of Perga 
(262BC-190BC) 

Diodes Hipparchus of Rhodes Zenodorus 

(240BC-180BC) (190BC-120BC) (200BC-140BC) 

Ptolemy 
(85-165) 

Heron of Alexandria Nicomachus of Gerasa 

(lAD) (60AD-120) 

Diophantus of Alexandria 
(200-284) 

Pappus of Alexandria Theon of Alexandria Diadochus Proclus Anicus Boethius Eutocius of Ascalon 
(290-350) (335-395) (411-485) (475-524) (480-540) 



Nicomedes 

(280BC-210BC) 



Menelaus of Alexandria 
(70AD-130) 



Figure 1.18. A table of Greek Mathematicians. 





2. Sequences of Real Numbers 



As we have seen, we can represent any rational number, for instance \/2, by 

its successive approximations with rational numbers, gi, ^ 2 ? According 

to Greek mathematicians the process which generates the approximations 
^ 1 ,^ 2 ? • • • never ends; for us, instead, such a process is the realization of 
\J2 as the limit of the sequence {^n}- In this chapter we shall discuss the 
notions of sequence and of limit of a sequence. 

In Section 2.1 we discuss basic properties. They may be inferred by 
analogous properties for limits of functions proved in [GMl]. However, we 
supply direct proofs for two reasons: first to be self-contained and, secondly, 
because one may want to discuss limits of sequences before limits of func- 
tions. In Section 2.2 we discuss the important notion of Cauchy sequence^ 
we prove the Bolzano- Weierstrass theorem and give various equivalent 
formulations of continuity of the reals. In Section 2.3 we give alternative 
simple proofs of the intermediate value and Weierstrass theorems. Finally, 
in Section 2.4 we discuss a few examples, and in Section 2.5 we give an 
alternative definition, in terms of sequences, i.e., just continuity, of the 
exponential and logarithmic functions. 



2.1 Sequences 

2.1 Definition. A sequence with values in a set X, or simply a sequence 
in X, is a function x :N X. 

A sequence is denoted by {xn}, n > 0, or by {a^njneN; that is x(n), 
is referred to as to the n-th term of the sequence {x^}. Accordingly, any 
enumeration of points of X by means of an index, which varies in an infinite 
subset of the integers, is called a sequence, too: for instance we say “the 
sequence 1/n with n odd” for {xn}neN with Xn = l/(2n + 1). 

There are many ways to produce sequences. For example we can give 
a formula to compute Xn for all n, as 

1 ^ -}- sin(l/n) 

Xn — 5 ^ ^ I 5 ~ j 

n n\ 



n > 1 
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or, and this is in many respects more interesting as we shall see later in 
Chapter 8, we can give a rule which gives each term of the sequence in 
terms of the preceding terms as in 



xo = 1 , 

Xn+l = f{Xn), Vn > 0, 



(2.1) 



where / : M E is a given function. In this case we can compute 



xo = 1 , 

xi = f{ao) = /(I), 

X2 = /(oi) = /(/(I)), 



and it appears clearly that (2.1) defines uniquely the sequence {x„}. Ac- 
tually, this is a consequence of the induction principle: the set 






n € N 



x„ is defined 



by (2.1) I 



is inductive, hence A = N, i.e., {xn} is uniquely defined for all n G N. It 
is usual to refer to (2.1) as to the recursive definition of {xn} or to the 
recursive sequence {xn}n- 



2.2 Example (Integer powers). If g G M and n G N, then is defined as the 
product of q by itself n times, 

q^ := qqq ' • ’ q, n times. 

However this is a costly definition: we need to recompute with an increasing number 
of multiplications every time we increase the exponent. A simpler way to define all 
expressions q'^ , n £ N, is by the recursive definition 

= ( 2 . 2 ) 

\^q^+^=qq'^ Vn > 0. 

2.3 Example (Products). Let {un}, n > 0, be a sequence of real numbers. The 
product of the first n-terms of the sequence is 

n 

ai := ai tt2 • • • an. 

i=l 

The sequence Xn := rij^=o ^ > O 5 is clearly defined recursively by 



xi := ai, 

Xn-{-l •— ^ 1. 



(2.3) 
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Figure 2.1. Pascal’s triangle. 



2.4 Example (Factorial). For all n € N, the factorial n\ of n is defined as 1 if n = 0 
and as the product of the first n natural numbers 

n! ;= n(n — l)(n — 2) • • • 3 • 2 • 1 

if n > 1. All factorials are, in fact, defined by 

I xo = 1, 

]^Xn+l = (n+ l)Xn. 

2.5 Example (Sums). The sum of the first n + 1 terms of a sequence {an}n >05 of 
real numbers ao + ai + • • • H- Un is denoted by 

n 

i=o 

The sequence Xn := defined by 

{ So = <105 

Sn+l — Sn “f“ CLn-\-l Vfl ^ 0. 

In integral variable j, which varies from 0 to n, just enumerates the 

elements to be summed: it is a hound variable: we clearly have 

n n n n+2 

^ aj-2. 

j=0 k=0 j+2=0 j=2 




2.6 Example (Binomial coefRcients and Newton’s binomial). The binomial co- 
efficients (see, for example. Chapter 4 of [GMl]) are defined by 

/n\ _ — l)(n — 2) • • • (n — j -|- 1) 

\jJ jl(n-j)l j\ 

It is clearly seen that for j, n G N and 0 < j < n we have 



Vj, 0 < j < n. 
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Figure 2.2. Pascal’s triangle. 

O-O-’ 

Q 

where the last formula is known as Pascal’s formula. 

As an application of the induction principle let us give a proof of the binomial 
theorem 

n 

(a + 6)" = Vo, 6 6 M (2.4) 

fc=o 

(see, for example, 5.53 of [GMl]), which makes no use of calculus. 

The claim (2.4) is trivial if either a or 6 is zero. If, say, a is nonzero, by multiplying 
and dividing by we see that (2.4) is equivalent to 

(l + hr = f2Qh^- (2-5) 

k=0 ^ 

Therefore it suffices to show that the sequence Xn ■= ]Cj=o satisfies the same 

recursive definition of (1 + h)^, i.e., 

f XQ = 1, 

^n+i = (1 + h)xn Vn > 0. 




Since in fact xq = Y^j-q (j)h^ = 1 and 
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Figure 2.3. Blaise Pascal (1623-1662) 
and the frontispiece of his Traite du tri- 
angle arithmethique. 



TRAITE 

DV TRIANGLE 

ARITHMETIQVE* 

AVEC QYELQJBS AVTRES 
PETITS TRAITEZ S%R LA 
WESWE MATIERE. 




A PARIS, 

rucCuat 

1 SuQC Fjofpc:. 

M. DC, L .X V. 



n+1 



+ '“)-£ (“)'■' +5: (”)*'" - E (")»' + E C ! 



j=0 



j=0 



j=0 



j=l 






^n+l ? 



J = 1 

on account of Pascal’s formula, the claim is proved. 



j=i 



a. Limit of a sequence 

The notion of limit of a sequence plays a fundamental role in analysis. 

2.7 Definition (of limit). Let {xn} be a sequence of real numbers and 
let L e R. We say that {xn} tends to , or converges to L, or that L is 
the limit of {x^}, and we write 

Xn L or lim Xn = L 

n^oo 



if 

Ve > 0 3 rie G N such that \xn — L| < e Vn > rig. (2.6) 

2.8 Proposition. 1/n ^ 0 and 0. 

2.9 f. Show that that Xn L if and only if \xn — L\ ^0. 

2.10 Show that the two claims in Proposition 2.8 are both equivalent to the 
Archimedean property. 
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Figure 2.4. Frontispieces of A Treatise concerning the principles of human knowledge 
and of The Analyst by George Berkeley (1685-1753). 



2.11 Definition. We say that {xn} diverges or tends to +oo, or that +oo 
is the limit of {xn}j and we write 

Xn +00 or lim Xn = +00 

n—*oo 



if 

VM > 0 3 G N such that Xn > M \fn > u. 

We say that {xn} diverges or tends to — oo, or that — oc is the limit of 
{xn}, and we write 

Xn —00 or lim Xn = —00 



if 



VM > 0 3 z/ G N such that Xn < —M Vn > u. 



Finally we say that {xn} has a limit if {xn} converges or diverges. 



b. Properties of limits and calculus 

We may interpret the limit of a sequence the limit of a function. In 
fact, given {un}, fix an interval [xo,q[, q eR (say [0, +oo[), and a strictly 
increasing sequence {xn} in [xq, q[ with Xn q (say x^ = n if g = +oo), 
and define the step function (fa • [xo,^[— ^ K by 

if Xn<X< Xn+l. 



9^a(^) — 
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2.12 Proposition, an ^ L eR if and only if (pa{x) L as x q . 

2.13 Prove Proposition 2.12. 



2.14 %. Let {an}, {^>n} be two sequences, {xn} as above, and let <^a, be the corre- 
sponding functions. Show that 

O (fa-^b{x) = ^a{x) + (fb{x), 

0 ^ab(x) = ^a{x)ipb{x). 

Proposition 2.12 and Exercise 2.14 allow us to specialize properties and 
results we have already proved for limits of functions to limits of sequences. 
We however add the simple direct proofs, which are similar to the ones of 
the limits of functions (see, for example. Chapter 2 of [GMl]). In fact one 
might want to develop first the theory of limits of sequences and then the 
theory of limits of functions. As we shall see in Theorem 2.46 the two 
approaches are completely equivalent. 

2.15 Proposition. We have: 

(i) (Uniqueness) A sequence cannot have more than one limit. 

(ii) (Boundedness) If {xn} converges^ then {xn} is bounded. 

(hi) (Constancy of sign) Suppose that {xn} has limit L eR. 

a) If L > 0 (respectively L < 0), then there exists n such that 
Xn> 0 (respectively Xn < 0) for all n>n. 

b) If there exists n such that Xn > 0 (respectively Xn < 0) for all 
n>n, then L >0 (respectively L < 0). 

Proof, (i) Suppose Xn Xn ^ L 2 , and Li ^ L 2 . If Li,L 2 € R, then for e := 

\Li — 1/2 1/2 we find Xi^ such that \xu — Li \ < e and \xu — L 2 I < e. Therefore 

2e = |Li — L 2 I < \Li — Xu\ + \xu — L 2 I < 2e, 

a contradiction. The cases in which L\ and/or L 2 are infinity are similar. 

(ii) Let {xn} converge to L. By definition, choosing e = 1, we find n such that \xn — L\ < 

1 for all n > n. In particular \xn\ < \xn — L| + |L| < 1 + \L\ for n>n. Hence 

M := |aji| + \x2\ + • • • + |xri_i| + |L| -|- 1 
is an upper bound for {|iCn|}. 

(iii) Suppose L > 0. Prom the definition of limit with e = L/2, we find n € N such that 

\xn — L| < L/2 for all n > n, that is, 0 < L/2 = L — L/2 < Xn < 3L/2, which proves 
(a). By contradiction one then sees that (b) is equivalent to (a). □ 



2.16 f Sequences need not have limits. Show that Xn := (—1)’^ has no limit as 
n ^ 00 . 



2.17 f . Show that, if Xn ^ L and Xn > 0 Vn, then L need not be positive. 

2.18 Proposition (Squeezing and comparison test). Let{an}, {bn} 
and {cn} be three sequences. Suppose that there exists ri such that Un < 
bn < Cn Vn > n. If an ^ L and Cn L, then bn L. 
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Proof. Suppose L G E; we leave to the reader the discussion of the cases L = ±oo. 
Given € > 0 we find rie such that L — e<an<L + e and L — e < Cn <L + e. Since 
dn bn Cn for all n > n, we conclude that L — e< bn <L + e for all n > max(n, ng), 
that is bn ^ L. □ 

In the applications, Proposition 2.18 is often used in the following version. 



2.19 Corollary. Let {x^}, {Vn} be two sequences and let L eR. If 
3 n such that \xn — L\ < yn^ Vn > n, and yn 0, 
then Xn ^ L. If 

3 n such that Xn < 2 /n Vn > n, and Xn +oo, 
then yn H-oo. 



2.20 Proposition. Suppose {x„} and {yn} have limits respectively £ and 
m in R. Then 



(i) If £-\-m is well defined in R, then Xn -hyn —> £ -hm. 

(ii) If £m is well defined in R, then Xnyn £p^- 
(hi) If £fm is well defined in R, then Xnjyn — > fjm. 

(iv) Ifyn—^0 and yn > 0 for all n, then 1/yn 3-oo. 



Proof. We prove (i) in the case i, m eR. The reader is asked to discuss the other cases. 
Given e > 0 we find and riy such that 

\xn — £\ < € for all n > rix \yn — m\ < e for all n > riy, 

hence for n > n := max(nx,riy), the two inequalities \xn — t\ < e and \yn — m| < e 
hold. By the triangle inequality we then infer 

\xn — ^ yn — m\ < \xn ~ t\ \yn ~ rn\ < e 4- € = 2e for all n>n, 

which yields the conclusion, since e is arbitrary. 

(ii) If m G M we write 

\XnVn - im\ = \xn{yn ~ m) + m{xn ~ i)\ < \Xn\ \Vn “ m| + \m\ \xn ~ t\ < {K + \m\)e 

where K is an upper bound for {|xn|} (see, for example. Proposition 2.15). This yields 
the conclusion. The cases £ = ±oo and m = ±oo or m 7^ 0, are simpler. 

(iii) If m G E, m 7^ 0, it suffices to notice that 



Xn £ 
Vn m 



mxn — £m + £m — £yn 
myn 







to conclude the proof, (iv) We ask the reader to prove it. 



□ 
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Figure 2.5. The frontispiece of Exercices 
Analyse et de Physique Mathematique 
by Augustin- Louis Cauchy (1789-1857). 
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c. Limits of monotone sequences 

2.21 Definition. A sequence {xn} of real numbers is said to be 

o bounded above if 3 c G R such that Xn < cVn E 
o bounded below if 3 c G R such that Xn > c Vn E 
o bounded if 3c G R such that \xn\ < c\/n E N, 
o increasing if Vn we have Xn < Xn^i, 
o decreasing if Vn we have Xn > Xn-\-i, 
o strictly increasing if Vn we have Xn < a^n+i? 
o strictly decreasing if Vn we have Xn > Xn+i, 
o monotone if it is increasing or decreasing, 

o strictly monotone if it is strictly increasing or strictly decreasing. 

Recall that we write 

sup A = +00 (respectively inf A = — oo) 

if A is not bounded above (respectively below). An important consequence 
of the continuity of the reals, on account of Proposition 2.12 above and of 
Proposition 2.30 of [GMl] or directly, is 

2.22 Proposition (Limits of monotone sequences). All monotonic se- 
quences have limits. More precisely, if {xn} is increasing, then Xn — > 
sup^{xn}, while if {xn} is decreasing, Xn mfn{xn}- 

Proof. Suppose {xn} is increasing, and let L := sup^fxn}, that we assume to be a real 
minimizer. Given e > 0, the properties of the supremum read 

(i) Xn < L, Vn, 

(ii) 3ue such that L — e < Xn^ • 
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Since {xn} is increasing, Xn^ < Xn for all n > rig, hence 

L — e < Xn^ < Xn L for all n> Ue, 

that is, Xn L, since e is arbitrary. We ask the reader to prove the other cases. □ 



d. Sequences and supremum 

By comparing the properties that characterize the supremum (or infimum), 
and the definition of limit we readily see 

2.23 Proposition. Let A C R be nonempty. A number L e R is the 
supremum sup A of A if and only if 

(i) L is an upper bound of Ay 

(ii) there exists a sequence {xn} C A that converges to L. 

A number L eR is the inhmum of A if and only if 

(i) L is a lower bound of A, 

(ii) there exists a sequence {xn} C A that converges to L. 

Notice that sup .4 = -hoo if and only if there exists a sequence {xn} 
with values in A that diverges to +oo, in fact sup4l = +oo is equivalent 
to the unboundedness of Ay i.e., to 

Vn > 0 3xn ^ A such that Xn > n. 

Similarly inf A = — oo if and only if there is a sequence of points in A that 
diverges to — oo. In conclusion, regardless of boundedness of Ay i.e., if A is 
nonempty, we can always claim the existence of a maximizing sequencCy i.e., 
a sequence {xn} C A that tends to sup^, and of a minimizing sequencCy 
i.e., a sequence {xn} that tends to inf A. 

2.24 More precisely, prove the following two propositions: 



Proposition. Let A be a nonempty subset of R. Then there exists an increasing se- 
quence {xn} C A that converges to sup A. Moreover we can choose {xn} to be strictly 
increasing if A has no maximum or to be constant if A has maximum. 



Proposition. Every real number is the limit of a monotone sequence of rational num- 
bers. 



e. Subsequences 

Of particular relevance is the notion of subsequence of a sequence. If {xn} is 
a sequence, a subsequence of {xn} is a new sequence obtained by choosing 
its values among the values of {x^}, however not randomly, but keeping a 
strict order on the indices. 




2.2 Equivalent Formulations of the Continuity Axiom 
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2.25 Definition. We say that {yn} is a subsequence of {xn} if there is a 
function k : N N strictly increasing, that is a sequence of nonnegative 
integers with k\ < k 2 < ks < — • , such that 

Vn = Vn € N. 

2.26 Example. The first terms of the sequence {1/n} are given by 

1, 1/2, 1/3, 1/4, 1/5, 1/6, 1/7,.... 

The terms 1, 1/3, 1/4, 1/7 may be the first terms of a subsequence of {1/n}, while no 
subsequence of {1/n} can start with 1, 1/3, 1/4, 1/2, 

2.27 Notice that /cn > n Vn, since fc : N ^ N is strictly increasing. 

2.28 Proposition. If has limit L € M, then any subsequence of 
{xn} has the same limit L. 

Proof. Suppose L E M, and let e > 0. By the definition of limit there exists rie such 
that \xn — L\<e for all n > rig. Since kn > n, we also have \xk^ — L| < e for all n > ng, 
i.e., Xkn L, € being arbitrary. The proofs in the cases L = ±oo are similar. □ 

In particular if {xn} has two subsequences with different limits, then 
{xn} has no limit 

The following two examples, though artificially simple, may serve to 
illustrate the usefulness of Proposition 2.28, 

2.29 Example. The sequence 1/n^ — >• 0 converges to zero, since it is a subsequence 

of {1/n} (the selection map being kn = to?) . Similarly 1/2^ 0. 

2.30 Example. > 0. In fact, since {1/y/fi} is decreasing it has a limit and l/^/n 

£ E M. On the other hand {1/n} is a subsequence of {l\/n} (the selection map being 
kn = n^), hence Ifn consequently ^ = 0, since the limit is unique. 



2.2 Equivalent Formulations of the 
Continuity Axiom 

a. The principle of nested intervals or Cantor’s principle 

2.31 Theorem (Cantor’s intersection theorem). Let Cn = [aniK] 
be a sequence of closed intervals ofR such that 

[^^n+l ? ^n+l] C \/n G N. 

Then there exists at least a point x common to all intervals, x G 

Proof. Clearly we have 

(i) ai < U 2 < as < • • • , i.e., the sequence {an} is increasing. 
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(ii) bi > 62 ^ 63 > • • • , i.e., the sequence {bn} is decreasing. 

(iii) Cin ^ bm for all n and m. 

Consequently [an] and [bn] are bounded monotone sequences, by Propo- 
sition 2.22 they have finite limits, an } iy bn i and we have 

dn ^ ^ ^ L <bm Vn, m G N. 



In particular 

00 

[e,L]c (\Cn. 

n=l 

□ 



2.32 Show that in the proof of Theorem 2.31 we have [£, L] = 



b. Cauchy criterion 

Except for monotone sequences we cannot state that a sequence converges 
without involving its limit in advance. 

2.33 Definition. We say that [xn] is a Cauchy or fundamental sequence 
if 

Ve 3 n such that \xh — Xk\ < € Vh, k >n. 

To a given sequence {xn}^ we associate a new sequence [dn] defined 
by 

dn:= sup \xh-Xkl (2.7) 

h^k>n 

Notice that dk can be understood as the length of the interval spanned by 
all the elements of the sequence {xn} but the first k. Definition 2.33 yields 

2.34 Proposition. A sequence {xn} is a Cauchy sequence if and only if 
the corresponding sequence {dn} in (2.7) tends to zero. 

If Xn — > f) then clearly for any € > 0 we have \xn — Xm\ < |^n — ^| + 
\xm — l\ < €. provided n^m are large enough, i.e., n, m > n in such a way 
that \xn — ^1 < e/2 Vn > n. In other words: every convergent sequence is a 
Cauchy sequence. 

It is an important fact that the opposite holds true. 

2.35 Theorem (Cauchy’s criterion). A real sequence is convergent if 
and only if it is a Cauchy sequence. 

Proof. It remains to prove that Cauchy’s sequences are convergent. We 
first show that Cauchy ^s sequences are bounded. In fact, choosing e = 1 , 
we find n such that 



\Xn — Xn\ <1 Vn > n. 
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X 



lim sup Xu 



lim inf Xn 
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n 



Figure 2.6. lim inf and lim sup. 



An upper bound for {|xn|} is then given by \xi\ + \x 2 \ H h \xn\ + 1. 

Define now for n = 1, 2, 3, . . . 

in := inf {Xk}, Ln := sup{a;fc}, 

k>n k>n 

and observe that {^n}? {L>n} are sequences of real numbers, {xn} being 
bounded. Moreover {£n} is increasing, {Ln} is decreasing and in < Ln 
Vn G N. By Proposition 2.22 we have in | i, Ln i L, and 

in<i<L<Lm Vn G N. 

Since {xn} is a Cauchy sequence and 

Ln ~~ in ~ sup \xji — Xk\^ 

h,k>n 

we get Ln — in 0 and therefore i = L =: x e R. Let us finally prove 
that Xn X. Fix e > 0 and let be such that x — in^ and Ln^ — ^ be not 
greater than e. Since for all n > we have in^ < Xn < Ln^ , we conclude 
that 

X - e <in^ <Xn< Ln^ < X + 6, 

i.e., Xn ^ X as n — > 00 , e being arbitrary. □ 



2.36 Remark. As we have seen, every real number is the limit of a strictly 
increasing (respectively decreasing) sequence of rational numbers. Consider 
the space of all Cauchy sequences of rational numbers in which we identify 
those Cauchy sequences {pn} and {qn} if Pn ~ Qn 0, i.e., if “they have 
the same limit.” Cauchy’s criterion then allows us, essentially, to identify 
this space with M. 
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c. Upper and lower limits 

Consider any sequence {xn} C M. The sequences {£n} {Ln} previously 
defined by 

Cn := inf {Xfe}, := sup{xfe} 

K>" k>n 

are respectively an increasing sequence and a decreasii^ sequence of ex- 
tended real numbers. Consequently they have limits in R, 

in sup/fc, Ln inf Lfc- 

k ^ 



We set 



liminfxn:= lim in = lim inf {x^}, 

n^oo n— >oo n— +oo k>n 

lim sup Xn := lim Ln = lim sup{xfc}, 

n—>oo n— >oo n—^oo /j>n 

and refer to them respectively as to the lower and upper limit or the limit 
inferior and the limit superior of {xn}. These new notions will be very 
useful in the sequel, here we confine ourselves to a few comments. 

2.37 Proposition. Every sequence in R has an upper and lower limit in 



From the definition 

lim inf Xn < lim sup Xn , and lim inf Xn = — lim sup(— Xn) • 

n— >oo n— ^oo n— >oo n— >oo 

Going to the proof of Cauchy’s criterion, we also see 

2.38 Proposition. Let {xn} be a sequence in R. Then Xn — > ^ G R if and 
only if 

lim inf Xn = lim sup Xn = f . 

n 

The following proposition characterizes the upper limit of a bounded 
sequence. 

2.39 Proposition. Let {xn} be a sequence in R. The number L e R is 
the upper limit of {xn} if and only if 

(i) Ve > 0 3 n such that Xn < L + e for all n > n, 

(ii) there exists a subsequence {xk^} of {xn} that converges to L. 

Proof. Let L = limsup^Xn € M and prove that (i) (ii) hold. By definition L = 
limn-^oo sup;j>^{xfe} hence for e > 0 there is rig such that 

L — e < supfxfc} < L + e for n > ng. 

k>n 



Because of the properties of the supremum, the last inequalities hold if and only if 
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Xn < L + e for n > Ue, (2.8) 

for all n> Tie there is k > n such that > L — e. (2.9) 

Clearly (2.8) is (i). Let us show a subsequence of {xn} converging to L. 

For e = 1 we choose n = rie, and, on account of (2.9), we find k\ > Ue such that 
L — l< Xki < L + 1. For e = 1/2 we choose n = max(fci + 1, rie) and again by (2.9) we 
find k 2 > ki -\-l > ki with k 2 > rie such that 

r 1 r 1 

2 ^ ^ + 2’ 

By induction we then find a subsequence {x^^ } of {xn} such that \xk^—L\ < -Vn> 1, 
hence converging to L. This proves (ii). 

Conversely, suppose that (i) and (ii) hold, and let e > 0. From (i) we infer that 
sup{a;fe}<L + e Vn > n 

fc>n 

and, since there is a subsequence that converges to L, 

sup{xfe} > L — e for all k large enough. 

k>n 

In conclusion there is rie such that 

sup{xk} - L < e 

' k>n ' 

that is, e being arbitrary, L is the upper limit 

Similarly we have 

2.40 Proposition. Let {xn} be a sequence of real numbers. The number 
L eR is the lower limit of {xn} if and only if 

(i) Ve > 0 3 n such that Xn > L — e for n > n, 

(ii) there exists a subsequence {xk^} of {xn} that converges to L. 

Proof. We can give a direct proof following the scheme of the proof of Proposition 2.39 
or derived from Proposition 2.39, since 

liminf Xn = — limsup(— Xn). 

n >’+cx> ^ — >■+00 



for n> rie, 
of {Xn}> 



□ 



2.41 f . Show the following 

Proposition. Let {xn} be a sequence in R. 

(i) +00 is the upper limit of {xn} if and only if {xn} has a subsequence that diverges 
to + 00 . 

(ii) —00 is the upper limit of {xn} if and only if {xn} diverges to — 00 . 

(iii) —00 is the lower limit of {xn} if and only if {xn} has a subsequence that diverges 
to — 00 . 

(iv) +00 is the lower limit of {xn} if and only if {xn} diverges to + 00 . 

2.42 The limit values of a sequence {xn} are the limits of the convergent subse- 
quences of {xn}- Show that the upper limit (respectively the lower limit) is the supre- 
mum (respectively the infimum) of the set of the limit values. 
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Figure 2.7. Bernhard Bolzano (1781- 
1848) and the frontispiece of the work 
where Bolzano- Weierstrass theorem ap- 
pears. 
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d. Bolzano- Weierstrass theorem 

Since every bounded sequence has a finite upper limit L and a subsequence 
converging to L, see Proposition 2.39, we can state 

2.43 Theorem (Bolzano— Weierstrass) . Every bounded sequence of reals 
contains a convergent subsequence. 



e. The continuity property of the reals 

We have seen that the continuity of reals implies 

(i) the Archimedean property, 

(ii) existence of the limit of monotone sequences, 

(iii) Cantor’s principle, 

(iv) Cauchy’s criterion, 

(v) existence of upper and lower limits, 

(vi) that every bounded sequence has a convergent subsequence. 
Actually we also have 

2.44 Proposition. Let X be an ordered held. The following claims (i) 
and (ii) are equivalent. 

(i) The continuity axiom (C), 

(ii) The Archimedean principle and one among 

a) Cantoris principle, 

b) Cauchy’s criterion, 

c) existence of limit of monotone sequences. 
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d) existence of the upper and lower limit of a sequence, 

e) every bounded sequence has a convergent subsequence. 

2.45 11. Show Proposition 2.44. [Hint: In order to show that every nonempty bounded 
set A has a supremum if the Archimedean property and one of the (a), (b), (c), (d) or 
(e) hold, it is convenient to take into account the following construction. Let A C R be 
nonempty and bounded. Choose ao € A, an upper bound 6o of A, and set c = (ao-l-^>o)/2. 
If c is an upper bound of A, we set a i := ao, 6i := c; otherwise we can find d € A 
with d > c > ao and, in this case, we set ai := d and bi = bo. If we repeat the 
argument starting from ai and bi instead of ao, 6o, and continue this way, we construct 
two sequences {an} and {bn} such that for all n, an € A, bn is an upper bound of A, 
dn ^bm dn is increasing, bn is decreasing, and 

bn — dn ^ {bo — ao)/2’^. 

Using this construction and one of the (a), (b), (c), (d) or (e) it is not difficult to show 
the existence of the supremum of A: one needs the Archimedean property to show that 
the limits of {an} and {bn} are equal.] 



2.3 Limits of Sequences and 
Continuity 

a. Limits of sequences and limits of functions 

The definition of limit of a sequence in Section 2.2 and of limit of a function 
can be reduced one to the other. 

2.46 Theorem. Let f :]a,b[-^ R be a function and xq E [a,b]. The fol- 
lowing two claims are equivalent: 

(i) f{x) -^LERasx-^ xq, x c]a, b[, 

(ii) for any sequence {xn} C]a, 6[\{o:o} withxn — ^ xq wehave f{xn) — > L. 

Proof. We prove the theorem in the case L eR and leave the proof to the 
reader in the other cases. 

(i) => (ii) Let e > 0. By assumption 

3 5 > 0: if X E]a, 6[, x ^ xq and \x — xo| < 5, then \ f{x) — L\ < e. (2.10) 

If {xn} converges toward xq, then there is an index u such that \xn—xo\ < 5 
for all n > u] since Xn ^ xq, and Xn c]a, 6[, (2.10) yields \f{xn) — L| < e 
for all n>v, that is f(xn) L, e being arbitrary. 

(ii) => (i) Assume that f{x) has no limit when x ^ xq. Then there exist 
eo > 0 and, for any given 5 > 0, a point x E]a, 6[\{xo} such that \x—xq\ < S 
while \f{x) - L\ > eq. Choosing 5 = 1, 1/2, 1/3, ... we define this way a 
sequence {xn} with values in ]a, 6 [\{xq} such that 

\xn - ^o| < 1/n and \f{xn) - L\> eq. 

In particular Xn xq and f{xn) does not converge to L: a contradiction. 

□ 




48 



2. Sequences of Real Numbers 



2.47 Example. Consider the sequence Xn := ^/nsm.{l/ ^/n). Since f{x) — sinx/x — ^ 1 
as a; — >• 0“*" and Xn = f{l/y/n), Theorem 2.46 yields Xn 1. 

2.48 Example. Let /, g :]a, 6[— >• E be two functions, xq € [a, 6], and f{x) L, g{x) — ^ 

M as X XQ X €]a, b[. We may prove that f{x) + g{x) —^L + Masx—^xossa 
consequence of Proposition 2.20. For any sequence {xn} C]a, 6[\{a;o}, which converges 
to Xq, Theorem 2.46 yields f{xn) — ^ L, g{xn) M, hence f{xn) + g{xn) — ^ L + M, 
according to Proposition 2.20. Again Theorem 2.46 then allows us to conclude that 

f(x)-\-g{x)—^L-\-M as X XQ, a;€]a, 6[. 

2.49 Example. Let us give another example proving the change of variable rule, 
Proposition 2.27 of [GMl]. 

Proposition. Let / : J E be a function defined on an interval I, let xq be a point 
in I or one of its extremal points, and let f{x) —^L,LEM,asx^XQ. Let x{t) : J I 
be a function defined in an interval J onto I such that x{t) xq as t to, to being a 
point in J or one of its extremal points. If one of the following two conditions holds: 

(i) xo G I and f is continuous at xo, 

(ii) x(t) never takes the value xo for t ^ to, 
then f(x(t)) —^Last—^to,t£j. 

Proof. Let {tn} be a sequence with values in J \ {to} that converges to to- Clearly 
{x{tn)} C I and, by Theorem 2.46, x(tn) xo- Let us prove that f{x{tn)) — > L. 

If X never takes the value xo for t ^ to, then x{tn) C I \ {a^o} and therefore 
f{x{tn)) —^Lhy Theorem 2.46. 

If f(xo) = L, for the subsequence {sn} of {tn} such that /(sn) 7^ xo we have 
f{x{Sfi)) —^Lon account of Theorem 2.46. Therefore for any e > 0 there exists n 
such that \f(x(xn)) — L| < e for any n > n such that x(tn) ^ xo. Since f(xo) = L, 
\f{x{tn)) — L\ < e for all n>n. That is, f{x{tn)) L, e being arbitrary. 

Finally, since f(x{tn)) L for any sequence {xn} C J \ {to}, the claim follows 
applying once again Theorem 2.46. □ 



b. Continuity in terms of sequences 

Let / : [a, 6] — )• M be a function and xq € [a, b]. We recall, see Chapter 2 of 
[GMl], that / is continuous at xq, if f{x) — > f{xo) as x xq, x £ [a, 6], 
or, in the e-S language 

Ve > 0 > 0 : if x G [a, 6] and |x — Xo| < 5, 

then |/(x) - /(xo)| < e. 



Theorem 2.46 yields at once 

2.50 Proposition. Let f : [a, 6] ^ M. / is continuous at xq E [a, 6] 
if and only if for every sequence {x^} C [a, 6] with Xn xq we have 
f(Xn) f(xo). 

In terms of sequences we can also give proofs of the intermediate value 
theorem and of Weierstrass’s theorem that are more robust, i.e., that can 
be extended to more general contexts. 
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2.51 Theorem. Let / : [a, 6] E be a continuous function on [a, 6]. If 
f{a) < 0 and f{b) > 0, then there exists Xq G [a, b] such that f{xo) = 0. 

Proof. Set oq := bo = b and c := {oq + bo)/2. If /(c) = 0, then c is the 
zero xq] otherwise we set 



I ai := c, bi := b if /(c) < 0, 

\ai := a, 6i := c if /(c) > 0. 

The function / is continuous on [ui, 6i] C [ao, 6o]? /(^i) < 0 and /(bi) > 0. 
Repeating the argument with oi,6i instead of ag and bg, and proceeding 
this way, we either find c with /(c) = 0 after a finite number of steps, or 
we construct two sequences {an} and {bn} with the following properties: 

(i) ag = a, bg = b, 

(ii) On is increasing, bn is decreasing, 

(iii) /(an) < 0 and /(bn) > 0, 

(iv) |bn ~ an| = 2 |bn— 1 an— 1 | = 2 ’^jbg ag|. 

When n oo an T ^ 0 ? bn i yg, and \yo — a:g| < |bn — an| Vn. Since by 
(iv) I bn — an I 0, we in fact have xq = yo, and / being continuous, 
/(an) f{xo) and /(bn) /(^o)« On the other hand, by the constancy 

of sign, according to (ii) we infer /(xg) < 0 and /(xg) > 0. We therefore 
conclude that we must have /(xg) = 0. □ 

2.52 Theorem (Weierstrass). Every continuous function / : [a, b] ^ E 
on a closed and bounded interval attains its maximum and its minimum 
value. 

Proof. Let us prove that / attains its minimum. Define E := /([a, b]) 
and let L := inf^^ and let {y-n} be a minimizing sequence for that 
is {yn\ C E and yn L. Since E — /([a, b]), there is also a sequence 
{xn} C [a, b] such that /(xn) = yn Vn. The sequence {xn} is clearly 
bounded and therefore, by the Bolzano-Weierstrass theorem, contains a 
subsequence {x^^} that converges to some point xg G E. Actually, [a, b] 
being a closed interval, xg € [a, b]. 

We shall now prove that xg is a minimizer for /. Since {xfc^} is a 
subsequence of {xn}, from one side /(x^^) — L\ on the other hand 

/(^fcn) /(^o)? / being continuous. Uniqueness of the limit yields /(xg) = 

L, that is the claim. □ 



2.4 Some Special Sequences 

In this section we discuss a number of sequences that turn out to be quite 
relevant for the sequel. 
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Figure 2.8. John Wallis (1616-1703) and the frontispiece of his Opera Mathematica. 



a. Elementary limits 

2.53 Example (Geometric sequence). Let Xn := , n > 0, q S R. li q = 1, then 

trivially q'^ = l\/n and > 1. If g = — 1, then q“^ =■ (—1)’^ has no limit. 



Proposition. We have 
(i) q^ oo if q> 1, 

(ii) q^ if \q\ < 1, 

(hi) q'^ has no limit if q < — 1. 



Proof. Let q > 1. Since q^ — > +oo as x — >> -f-cx>, x £ R (see, for example, [GMl]), we 
conclude that ^ oo by Theorem 2.46. 

For a direct proof, which makes no use of calculus, set g = 1 + h, h > 0, and from 
Newton’s binomial. Example 2.6, or from Bernoulli’s inequality. Example 1.34, we have 



= (1 + h)^ > n/i. 



thus — > oo by the comparison test (see, for example. Proposition 2.18). 

If \q\ < 1, we write \q\ = 1/(1 + h) with h > 0 to find 



kr < 



(1 + h)^ ~ nh 



0 



thus q'^ 0, again by the comparison test. Finally if g < — 1, the sequence has no limit 

since its two subsequences q'^'^ and g^n+i jj^vg different limits 

g2n ^ |^|2n ^ ^2n+l ^ _|g|2n+l 



□ 

2.54 Example. — > 1. In fact, for x 6 M, x > 0, we can write = exp (logx/x). 

Since the exponential function is continuous and logx/x 0 as x +oo (see, for 
example, [GMl]), we then conclude that 

> 1 as X — >• +00. 
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Consequently y/n — >• 1. 

We may also proceed as follows. We observe that > 1, therefore =: 1 -f hn 
with hn > 0, that is 

(1 + hn)'^ = n. 

Using the binomial theorem, Example 2.6, we deduce 



, = (1 + hn)^ = 1 + + positive terms > 1 H- 

2 2 



which yields 



0 < < - 
n 



o<hn= 

from which we get y/n — > 1 since Ijy/n 0. 

2.55 Example, y/a 1 for all a > 0. If a > 1, we have 
1 < a/u < y/n for n > a. 

Since ^/n 1, the squeezing test yields >/a — »1. If0<a<lwe write 



lim ^ = 



^ ^1 = 1. 
nl“oc 



The claims in Examples 2.56 and 2.57 below are part of a series of 
results known as Cesdro theorems. 

2.56 Example. We have 

Proposition. If an — >• L, then ^ L. 

Proof. Assume L G M and proceed similarly in the other cases. Given e > 0, we find n 
such that \ai — L\ < e for all i >n. On the other hand 



hence for n > n, 

li _ l| < i g _ L| + 1 < i g |a, _ L| + 

I n — I n “ n n “ n 

1=1 1=1 %=n %=1 

thus we conclude that | ^ Yl^=i o-i ~ l\^ < 2e for n sufficiently large. □ 

Notice that | ^ XIILi hsive a limit, while {an} has no limit, as the sequence 

{(—1)’^} shows. 



2.57 Example. We have 
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Proposition. Let {an} be a sequence of positive numbers and let L eR. If 

ttn+l j- 

► L, 

O'Tl 

then — > L. 



Proof. Suppose 0 < L < oo; the cases L = 0 and L = +oo are simpler, and we leave 

Ctn+l 



bn := log - 



0>n 



and observe that bn log L and that 

- log(an) = - f logao -h (logai - logao) + (loga2 - logai) + • 
n n V 

+ (log an - log an- 1 )) 

1 1 

= - logao + - hi. 

n. T7. ' 



Therefore ^ log(an) 0 + logL, on account of Example 2.56. This proves the claim. 

Let us give a more direct proof which makes no use of logarithms. Let 0 < e < L. 
Then we can find n such that for all n > n, 

(L - e) an < an+i < (L + e) an, 



and, by iteration. 



{L-er 



{L-e)n 



< fln < (L -f- 



(L-he)^’ 



that is, 
where 



(L — e) \/R < < (L e) y/C for n > n 



On 

(L - e)" 



and 



C:= 



On 

(L + e)^ 



depend on e, but not on n. Since \/C — > 1 , there is ni such that 

L — 2e < (L — e) y/B, and (L + e) v^C < L + 2 e 
for n > ni. We therefore conclude for n > max(n, fii), 

L — 2e <C ^ On <C L -|- 2e, 
which yields the claim, since e is arbitrary. 



□ 



2.58 Example. Let Xn be the quotient of two polynomials at n, 

_ OpuP + ap_in^“^ + h ain + ao 

bqTi^ + bq—in^~^ + • • • + bin + bo 

p and q being the degrees, that is, Op,bq 7 ^ 0. Of course numerator and denominator 
diverge, but by factorizing in the numerator and in the denominator we find 

ap + ap_i^ + -*- + ai-^ + ao^ 

Xn=nP ^ ^ V— . 

bq +bq-i- 4 - \-bi-^ "^^ 0 ^ 

The second factor, the largest fraction on the right, tends to apjbq. We then conclude 
that 



Xn 



I +00 • sgn (Op/bq) ifp> q, 

< Op/bq if P = 9 , 



0 



a p < q. 
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b. Powers, exponentials and factorials 

2.59 Example (Exponentials grow faster than powers). Let Xn := n^lq^ with 
fc G N and q > 1. We claim that 



n 



k 



qu 



0 . 



For that, write q = 1 + h with h > 0 and apply the binomial theorem with n > k + 1 
to find 



= (1 + ^ + positive terms > 



n(n — l)(n — 2) • • • (n — /c) 
(fc + 1)! 



that is 

< (fc + l)!n*= 

~ q^ ~ h^+^n(n — l)(n — 2) • • • (n — fc) 

This yields the claim by comparison, since the right-hand side is the quotient of two 
polynomials respectively of degree k and fc -|- 1, and thus tends to zero, according to 
Example 2.58. 



k 

2.60 Example (Factorials grow faster than powers). Xn ^ > 0. For that 

observe that n! = n(n — l)(n — 2)(n — 3) • • • 3.2.1 > n(n — l)(n — 2)(rT— k) if n> k-\-l. 
The comparison test and Example 2.59 then yield 



n! n(n — 1) • • • (n — fc) 



2.61 Example. Xn ’= nXjinP' 0. It suffices to observe that 

n!_nn — 1 1^1 

n n n ~ n 

2.62 Example (Factorials grow faster than exponentials). Xn ^ 0. This 

is trivial if 0 < g < 1. If g > 1, we observe that 

Sl = 1 g ^ 

n! n n — 1 1 

and that all factors q/n with n > q are smaller than 1. Thus 



o<Sl = l± 9 



, - r , r , , ‘=( 9 )“ ^ 9' 

n! n [q\ [gj — 1 1 n 



where [g] denotes the integer part of g, and this yields the result trivially. 

2.63 Example (Euler’s number). For any x 6 M we have 

lim (l + -Y = e^. (2.11) 

n—*oo \ 72 / 

The claim is trivial if x = 0. For x ^ 0, recall that De^ = (see, for example. 
Section 4.3 of [GMl]), a claim which is equivalent to 

V . T log(l + x) ^ 

hm = 1, or to hm = 1, 

a :— ^0 X x-^0 X 

compare 4.3 of [GMl]. By the change of variables 2 / = 1/t, t > 0, this yields 



( 2 . 12 ) 
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or 






X lx 








lim 1 
t+00 




V 

= e, 


consequently 


lim 1 

t+00 




( lim ( 

\ t+00 \ 


■n 



Thus Theorem 2.46 yields (2.11). 
Prom (2.11) we get in particular 



e= lim fl-h— ) (2-13) 

n—*oo \ fi/ 

which in fact is trivially equivalent to ( 2 . 11 ). In Section 2.5 below we will deduce ( 2 . 12 ), 
and consequently ( 2 , 11 ), directly from (2.13) making no use of calculus. 

2.64 Example (Compound interest). If d is the rate of interest per cent and inter- 
ests are capitalized every year, the accumulated amount of an original capital x after n 
years is given by 



XQ = X, 

Xn+l =Xn(l + d), 

which yields 

Xn = x(l + d)'^. 

If the interests are capitalized N times per year, then we have 

/ d \ 

Xn=x[l+-) 

and for a continuous capitalization, 

Xn-=x lim =xe^^, 

N—^oo \ iV / 

according to Example 2.63. 

2.65 Example (Sum of the terms of a geometric progression). Let q EM. Let 
us compute the sum 

n 

Gq{n) := ^2 9^ = 1 + q + h q^ . 

j=0 

If q = 1, then Gi(n) = n -f 1. For q ^ 1 the following formula holds: 



G,(n) = 

as it is easily seen multiplying both sides by 1 — q, 

(1 - q)Gq(n) = (1 - 9 ) 539 ^ = “ 9 ^ 9 ^ 

j=0 j=0 j=0 



Vn > 0 



(2.14) 



= l + 9 + 9+"- + 9”-9-9-r 9 "+ = 1 - 9 ' 



-n +1 



Formula (2.14) yields 

Gg{n) 



+00 if g > 1, 

if |9| < 1 . 

1-9 

^does not exist if 9 < — 1 . 



(2.15) 
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Figure 2.9. The frontispiece of Arith- 
metica infinitorum by John Wallis 
(1616-1703). 
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c. Wallis and Stirling formulas 

2.66 Example (Wallis’s formula). For n = 1, 2, . . . , define Jn by 

^ 7 t /2 

Jn •= I sin^xdx. 

Jo 

Since an integral of sin’^ x, In{x), is given (see, for example, Example 4.34 of [GMl]) by 



I Iq{x) = X, Ii{x) = — cosx, 
I In(x) = ^In- 2 {x) - 



Vn > 2, 



V ' ' 


n 


n 


we have Jn = In(7rl2) — /n(0), i.e.. 




Jo = l, 


rH 

II 


j 

Un — ' 

n 


Consequently 






2n — 1 2n — 3 
2n 2n-2 


1 7T 
’2*2’ 


and J 2 n+i 



Jn -2 Vn > 2. 



2n 2n — 2 



(2.16) 



2n + 1 2n — 1 

which we can write, introducing the semifactorials 

(2n)!! := 2n(2n - 2) • • • 4 • 2, (2n + 1)!! := (2n + l)(2n - 1) • • • 5 • 3 • 1, 



^1, 

3 



as 

^ _ (2n— 1)!! 7T ^ _ (2n)!! 

"" (2n)!! 2 ’ “ (2n + !)!!* 

Since sin^"*"^ x < sin^ x for all fc > 1, we have Jk-\-i £ Jk Vfc, hence 

1 < *^271 _ 2n + 1 J 2 n ^ I ^ ^ 

~ J2n+1 2n J2n-1 ~ 2n 

Consequently J 2 nlJ 2 n+i — >• 1^ this is Wallises formula for tt: 
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7 T_ [(2n)!!]^ 2 2 4 4 (2n) (2n) 

2 niJoo (2n + l)!!(2n — 1)!! 1 3 3 5 (2n — 1) (2n + 1) 



Since (2n)!! = 2^n! and (2n)!!(2n - 1)!! = (2n)!, we can rewrite (2.17) as 

= lim 

2 n^oo (2n)!(2n + 1) 



or equivalently 



y/rr = lim 

n— >oo 



(2n)ly/n 



(2.17) 



(2.18) 



2.67 Example (Stirling’s formula). In many applications, notably in statistics, an 
estimate of the order of magnitude of n! for n large is useful. This estimate is provided 
by Stirling’s formula 

lim ^ 

n—^oo y/n 

or, in terms of asymptotic expansions, 

n\ ^ n^e~^y/2'Kn. 



In order to prove this, set 



Q>n • — 



n^e~'^y/n' 



n > 1, 



and observe that the real function 



/(*):= ^log(l + t), 0<t<l, 

is strictly increasing, f{t) — ► 1 as t — > O"*" and f{t) < 1 + t^/4 since log(l + t) < 
t — t^/2 + t^/3. Therefore 

1 < = - (l + = iexp (/(1/n)) < (2.19) 

ttn +1 e V 72/ e 

From (2.19) we see that {an} is strictly decreasing, thus it converges to a limit L, L > 0, 
and that L > 0 since ane~^^^ is strictly increasing to L. In order to compute L, we 
observe that 

al _ (n!)^2^^ 

a 2 nV 2 (2n)ly/n' 

Passing to the limit and taking into account Wallis’s formula (2.18), we find 

j2 

— ^ = y/n i.e., L = \/2^. 

LV2 

Notice that the limits 



72! Jo if a > e, 

|^-|-cx) if a < e 

are easy to obtain, instead. In fact, if an ? we have 

( t 2 + 1)72! 

Qn+i _ (72 + l)^+^g^+^ = £ 

On yt! a V 72 / a 

n''^a'^ 

This implies that {an} grows or decays exponentially fast if a < e or a > e respectively. 
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d. Numerical integration 

Let / : [a, 6] — > R be a Riemann integrable function, and a = xq < x± < 
•••< Xn = b he a, subdivision of [a, 6]. Then the integral of /, see, e.g.. 
Section 3.2.1 of [GMl], is the limit of the sums 

n— 1 

- Xi)f{^i) 

i=0 

when the length sup^ \xi^i—Xi \ of the partition a = xq < xi < - • < Xn = b 
tends to zero, ^ being any point chosen in [xi,Xi-^i]. 

This allows us to find approximate formulas for the integral of /. 



2.68 The rectangle rule. Divide [a, 6] into N equal parts by means of the subdivi- 
sion Xi = a i = 0, . . . , N, and choose := {xi -|- Xi^i)l2. Then we have 

N-l 



f fix) dx = ^ • 

i=0 



Assuming that / is regular, it is possible to estimate the error 

rb r N-i 



E{h) := [ fix) dx - ^ /(Ci) 

•'o i=0 



in terms of h := {b — a)/N. Suppose for instance that / G 6]) then 



<E.-p i/iy^. I- — ^ 



dx 



i=0 + 

N-l 



= i ^ sup l/'l iXi+l-Xi)^ 

^ i=0 

= sup l/'l = (6 - a) sup l/'l 

2A ]a,b[ 2 



]a,b[ 



2.69 The trapezoid formula. Here we choose as approximate value of the integral 
the integral of the piecewise interpolate / of / at the points Xi := a + that is of 

f{x) := f{xi) (x - Xi), Xi:=a + hi, Xi<x<Xi^i. 

h 

Therefore 

(b-a) f(xj) -f fjxj^i) 

A ^ 2 

1=0 

If / is of class by integrating by parts twice and using that f{xi) = f{xi) Vi, 
we find 

r^i+i ^ 1 r^i+i 

/ if-f)dx=- {x-xi){x-xi+i)f {x)dx, 

Jxi ^ Jxi 

hence, being as above h := {b — a)/N, 
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In conclusion we have 




< sup |/"| 

]a,6( 



(b - a)^ 

12AT2 



or 




b-a f{xi)+f{xi+i) / 1 \ 

N ^ 2 \N^J 

1=0 



2.70 Simpson’s rule. Instead of interpolating with a piecewise linear function, we 
want to interpolate with a quadratic function. Let / : [— 1, 1] — > IR; we look for f{t) = 
+ Bt + C,tG [-1,1], in such a way that /(-I) = /(-I), /(O) = /(O), /(I) = /(I). 
We easily find 



consequently 

/.'i / = 5 (/(-l) + 4/(0) + /(I)) • 



C = /(O), 



Set now Xk := a + kh, h := N = 2n 1 odd, k = 0, , 2n. From the above we 
then infer (by changing variable) 



/ = ^(/(^O) +'^/(^l) +/(®2)), 

/ /“^^ ^(/(®2)+4/(X3) + /(X4)), 

f fdx=^ (f{x2n-2) + 4 /( x 2 „- i ) + f(x2n)) , 
Jxo„-0 d / 



that is 



jydx=f^(^f{a) + 2j2 /(2fc) + 4 E /(^'^ + 1) + /(*>)) • 



In conclusion we find 
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fdx= f^fdx-\-E{h) 

J a J a 

known as Simpson’s rule. One can show that, if / G C^{[a, 6]), then 



( 2 . 20 ) 



in particular E{h) = 0{h^) as h — > 0. 

2.71 %% Prove (2.20). [Hint: Set 

rx+h / h X 

R{h) = ^ [fi^) - 3 (/(^ -h)+ 4f{x) + fix + h)j dx. 

Show that R"'ih) = for some ^ €]x — h, x + h[. From R{0) = R'{0) = 

R"{0) = 0 infer then that R(h) = — some ^ G [x — h, x 4- h], and hence 

the result.] 



2.5 An Alternative Definition of 
Exponentials and Logarithms 

In [GMl] we defined Euler’s number e, the exponential and logarithm 
functions by making use of the differential and integral calculus. For the 
sake of completeness we give in this section a more direct, though slightly 
more involved, definition, which makes use of the calculus of limits and 
of continuous functions. The procedure is as follows: we define for x 
rational, we then extend “by continuity” : Q — > M to a continuous 
function from R R. Then we define the logarithm with base a as the 
inverse of a^, which turns out to be continuous because of 2.48 of [GMl]. 



a. A definition of o® using continuity 

2.72 Rational powers. Let a > 1. Taking into account the existence of 
the n-th root of a real number, we define when r is rational by 

:= if r = p/q G Q; (2.21) 

in fact, it is not difficult to show that the result of the operation 
depends only on the quotient p/q and not on p and q {if ^ then 

= V^). Thus = (^)^. When a < 1, we set, still for 

r G Q, 




and, if a = 1, we set = l’^ = 1 Vr G Q. This way is defined for a > 0 
and r G Q. 
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The computational rules for the exponentials of rational numbers are 
now an easy consequence: if x, y G M, x, y > 0 and r, s G Q, we have 

X^yr = {xyY^ = X^+^ (x’^)^ = X^^ 

if 0 < X < y and r > 0, then x^ < y^, 
if 0 < X < 1 and r < 5, then x^ < x’^, (2.22) 

if X > 1 and r < s, then x’^ < x®, 




2.73 Real powers. The idea is to define when a G M, a > 0 and 
X G M as the limit of where {xn} is a sequence of rational numbers 
converging to x. For that we need, and in fact it suffices to show that: 

(i) the sequence converges if x^ — > x, 

(ii) the limit of does not depend on {x^} itself but only on the limit 

X. 

Finally, we need to show that the new function agrees with the old one 
if X is rational. 

Suppose a > 1. Choosing b := — 1 in Bernoulli’s inequality, 

(1 + 6)^ > 1 + nh, Vn G N, V6 > -1, 

we deduce 

(2.23) 

n 



Also, it is easily seen that 

|a’'-l| -1, VreQ, (2.24) 

since a > 1. The inequalities (2.23) and (2.24) then yield the following. 

2.74 Lemma. Ifvn is a sequence of rationals with Vn 0, then — > 1. 

2.75 Proposition. For any sequence {xn} C Q with Xn x, the se- 
quence has a limit that depends only on x. 

Proof. If {xn} is an increasing sequence that converges to x, then {a^'^} is increasing 
and bounded, therefore has a limit, L. Consider now any sequence of rationals 

{Vn] so that yn x, then 

|a?/n - L\< \ay^ - ~ L\ = - 1| + - L\- 

Since > L and yn — Xn ^ 0, Lemma 2.74 and the comparison test yield a'^'^ L. 

□ 
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Because of Proposition 2.75, the function 

A{x) := lim a®", {x„} C Q, x„ — > x, (2.25) 

n — >cx) 

is well defined for any x; moreover A{x) = a® for any x G Q: in fact, if we 
choose Xn := X Vn, then 

A{x) = (by (2.25)) = lim a®"" = (by (2.21)). 

n — >00 



Therefore ^(x) extends to the reals the function a^, x G Q, and will be 
denoted again by a^, x 6 M. 

Then one defines when 0 < a < 1 by 



a 




— X 



and = 1"^ := 1 if a = 1. 



2.76 Real powers: laws of exponents. If we take into account how 
the operation of limit behaves with respect to the algebraic operations 
and the order of R, it is easy to extend the claims (2.22) and (2.24) to the 
case in which r, 5 G R. 



2.77 Show that (2.22) and (2.24) hold for r, s G M. 



2.78 Continuity of the exponential. The argument in Proposition 
2.75 shows also the following. 



2.79 Proposition. If {xn} C R, Xn — > Xy then a^. 

Theorem 2.46 then implies that a® is a continuous function in R, Since for 
o > 1 , > +00 as n ^ 00 and a^, x G R, is monotonically increasing, 

we also infer 



inf = 0, supa^ = +oo, 0 < a ^ 1. 
xeR xeR 



This way the exponential function is defined for all a > 0 and all x G R 
and agrees with the exponential function defined in [GMl], since both are 
continuous and agree on rationals. 
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b. Euler’s number e 

Before proving the differentiability of the exponential function a^, we need 
a definition of the Euler number e. 

2.80 Proposition. The sequence Xn := (1 -h 1/n)’^ is increasing and 
bounded, 2 < Xn < S. 

Proof. In fact 



/ 1 \ — 1) • • • (n — A; + 1) 

k=0 k=0 



k=0 



(2.26) 



Since each term in the sum increases with n and moreover, when n increases, new 
positive terms add to the sum, we infer that {xn} is strictly increasing and Xn > xi = 2 
Vn. Equality (2.26) yields also 




n ^ 

<T- 

- ^ k\ 



fc =0 



On the other hand 2^ < k\ for fc > 4, therefore 



^ i ^ 2-^ = -1 - 1/2 - 1/4 - 1/8 + Gi/2(n) 

j=4^' j=4 

< -1 - 1/2 - 1/4 - 1/8 + 2 = 1/8, 

if we take into account Example 2.65. We then conclude, for all n > 4, 



Xn < 1 + 1 + 




. . Ill 

<1 + 1 +- + - + - 
2 6 8 



< 2.9. 



□ 

The sequence (1 + 1/n)’^ has therefore a finite limit, as it is increasing 
and bounded. Set 

e:= lim fl + i)", (2.27) 

n-^+oo V n/ 

and notice that 

2 < e < 2.9. 



c. Derivative of the exponential 

Finally, let us show directly that Da^ = logg a. First we show that (2.27) 
yields 

2.81 Proposition. We have 

lim(l + x)^/* 



= e. 



(2.28) 




2.6 Summing Up 
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Proof. Of course we do not want to differentiate. Therefore we denote by [x] the integral 
part of X and observe that 

and that 

/ l\n+l , / 1 \n 

( 1 H — ) e, and ( 1 H ) — > e. 

\ n/ \ n 1/ 

Given € > 0 we then find n G N such that 

( 1 \ n / 1 \ ^+1 _ 

1 H ) , and ( 1 H — ) < e + e for n > n, 

n + 1/ \ nJ ~ 

and, on account of (2.29), 

e — e<^l+— ^ <e + e for x >n. 
e being arbitrary, this yields 

lim fl + = e. (2.30) 

\ X/ 

On the other hand, if x < 0 and y = — x, 
hence 

lim (l + !)''= lim ('l+-i-V=e. (2.31) 

X— — oo V X/ y— ^-+cx5 \ 2/ — 1 / 

Changing variable x = I/2/, from (2.30) and (2.31) we finally infer 

lim (1 + x)^/^ = e. 



Since log a: is continuous, as it is the inverse of a continuous function, 
(2.28) can be written as 

lin. M1+£) _ 1 

X— ♦O X 

and, again changing variable, 2 / = — 1, we get 

Um = 1, 

X— >0 X 

which yields that is differentiable and De^ = e^, and also Da^ = 
a^log^a; (see, for example, 4.3 and 4.4 of [GMl]). 



2.6 Summing Up 

Limit of sequences 

Definition 

Xn L as n ^ 00 means: for every neighborhood U of L there is n such that Xn 6 U 
for all n>n. 
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Properties 

o UNIQUENESS. The limit is unique if it exists, 
o BOUNDEDNESS. If Xn L e R, then {Xn} is bounded. 

o SQUEEZING TEST. If an < bn < Cnand an L and Cn — >■ L, then also bn —*■ L. 
o COMPARISON TEST. If an > bn Vn and bn —*■ +00, then also an +oo. 
o CONSTANCY OF SIGN. If Xn ^ L and L > 0, then Xn is positive for n large enough. If 
Xn >0 and Xn L, then L > 0. 



Limit of functions and limit of sequences 

o Let / :]a, 6[— > R and xo G [a, 6]. Then /(x) — )>L€Rasx— ^xo,x G]a, 6[, if and only 
if f{xn) L for any sequence {xn} C]a, 6[\{xq} with Xn — >• xq. 

o Let / : [a, 6] R. / is continuous at xq € [a,b] if and only if /(xn) — ^ f{xo) for 
every sequence {xn} C [a, 6] with Xn —>■ xq. 



Fundamental theorems 

{xn} is a Cauchy sequence if Ve 3n such that \xh — x^j < e for all h,k >n. 



o Monotone sequences Every monotone sequence has limit in R. 
o Maximizing and minimizing sequences For any nonempty .A C R there exist a max- 
imizing sequence^ that is, a sequence {xn} C A with Xn — >• sup A, and a minimizing 
sequence, that is, a sequence {xn} C A with Xn — > inf A. 
o Cauchy’s criterion {xn} is convergent if and only if {xn} is a Cauchy sequence. 

o Bolzano-Weierstrass theorem Every bounded sequence contains a convergent 
subsequence. 



Upper and lower limits 



Definition 

liminfxn := lim inf {x^}, limsupxn := lim suplxt}, 
n->-oo n-^ook>n n-*oo k>n 



Properties 

o lim infn^oo Xn, lim Xn always exist in R, and lim infn-^oo Xn < lim sup^^^^ Xn, 

o Xn L if and only if liminfn^oo Xn = limsupy^_^oo Xn = L, 
o Let L G R. L = limsup^_,oo Xn G R if and only if 

(i) Ve > 0, 3n such that Xn < L + e for all n>n, 

(ii) there is a subsequence {x^^} of {xn} such that Xk^ — ► L. 
o Let L G R. 1/ = liminfn-^oo Xn G R if and only if 

(i) Ve > 0, 3 n such that Xn > -t/ — e for all n>n, 

(ii) there is a subsequence {xfc^} of {xn} such that Xk^ — ► L. 
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Figure 2.11. Some remarkable limits. 



2.7 Exercises 



2.82 1. Let 



A := {an € K I n G N}, B := {bn €R\ne N}, 
C := {an + I n € N}, D := {anhn \ n e N}. 



(i) Show that sup C < sup A + sup B. Give examples in which the inequality is strict. 

(ii) Assume an, bn > 0, Vn and show that supD < sup A sup B, observing that the 
inequality can be strict. 

2.83 Find the infimum and the supremum of some of the following sets: 



— — I n G N, n > 1 1, 
|- - - lx, ye (0,1)), 
|sinx I X G (0, 7t/2)|, 



|l— l|n€N, n>l|, 

< — n, m G N, n, m > 0 >, 

I J 

I n G N, n > l), a > 0. 



2.84 f . Show that \/^, n, y/'n? + 2 are subsequences of y/n^ that is, declare the se- 
lection function. 



2.85 i[. Show that 



1 . /I 1 

hm ( - + -= 
n-»-oo \n \/n2 - 



2 4- 2n. / 
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2.86 %. Compute, if they exist, the limits of some of the following sequences: 



n + cos n 



sin(n) log(n) sin(l/n), 

^3 + n + in?, 

— 1 / \/n^~+T, 






y/n{-\/n + 1 — y/n — 1), 

sin(l/n) 

1 — cos(l/n) ’ 

(cosn + 3)’^ 



ilog(n-f n^), 
n 



/ n + \/n — \ n — \/n, 



/-n? 6 



v^8n^ — n — n ’ 
/ 2\n 

/ l\n‘^ 

(iH ) 5 

V n/ 



2.87 % f . Compute, if they exist, the limits of the following sequences: 



rn+l 

/ [tt — 2 arctan dt, 

J n 

rn+l 2 

/ e-‘ dt, 

J n 

f {log 
Jo 



sinn^x 

I tan dx, 

Jo n 

rn 2 

y/n / e~'^^ logtdt, 

Jo 

n r xe-^^dx. 

Jo 



2.88 If. Let Xn := |1 — 10 Find the supremum and the infimum of the set 

A:= {xn\xl^ <1- {xn - 1 )^}. 

2.89 f f . Compute the upper and the lower limits of the following sequences: 



an := y|n-(-2)-|, 

n odd, 

^ neven, 

an := distance between n and the closest square. 



ttn ( 1 + 



a„ := { 2vAE+3 

l (_ l )»/2 



2.90 Show that every sequence contains a monotone subsequence. 

2.91 11. Show that {xn} has limit if and only if every subsequence of {xn} contains 
a further subsequence which has limit and all these limits are equal. 
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2.92 11. Show that and in general are complete metric spaces. [Hint: Show 

that a sequence converges iff the sequences of its components converge.] 

2.93 1. Discuss the following claims. 

(i) If {an} converges and {bn} does not converge, then {anbn} does not converge. 

(ii) If {an} and {bn} are monotonically increasing, then {anbn} is increasing. 

2.94 1. Let an > 0 Vn and an-\-\jan L. Show that 

(i) an 0 if 0 < L < 1, 

(ii) an +00 if L > 1. 

2.95 1 1 Cesaro’s theorems. Show that 

(i) liminfn an < liminfn ^ - liinsup^ ^ ^ limsup^ an, 

(ii) liminfn (an — fln-i) < liminfn ^ < limsup^ ^ < limsup^ (an — an-i). 

In particular, in case {an — ctn-i} has limit, then 

lim — = lim (an — On-i)- 

n — +00 n—*oo 



(iii) If an > 0 Vn, 



lim inf an < lim inf 



1.1XXX All 

in particular, if {an} has limit, then 



aj < lim sup 



n 

A £ lim sup an; 

\ 1 n—*cx> 

\ 1=1 



lim 

n—*oo 



n 




lim an- 

n—*oo 



2.96 1. Discuss the convergence of the following sequences 

11 1 logn! 

n + 1 n4-2 2n’ n’ 

sin ^7T y4n2-|-V^^ . 

2.97 1. Show that 

lim sin nx exists iff x = kir, G Z, 

n—*oo 

lim cosnx exists iff x = 2fc7r, k G Z. 

n-^oo 

[Hint: Using the double-angle formula show that, if sinnx L, then either L = 0 or 
L = use then the addition formula for sin(n -f l)x to produce a contradiction if 

X ^ kn.] 

2.98 1 Pythagorean algorithm. Set ai = 6i = 1, and for n > 1 

an+l “ 4" 6n, 

bn-\-l — ^an bn’ 

Show that bn Ian \/2. [Hint: Show first that 
o an,bn >2Wn >2, 
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o 

o 

o 



{an} and {bn} are both strictly increasing, 



O'U j bn ^ OO , 

bl-2al = {-ir.] 



2.99 % Heron’s algorithm. Let o be a positive number. Define recursively the se- 
quence {xn} by 



xo = a > 0, 

Xn+l = |(a:n + ^). 



(2.32) 



Show that Xn Show also that the absolute error 6n := Xn — verifies (5n-i-i < 

^51, that is 

< 2\/a j , Vn > 0 and p > 1. 



[Hint Observe that 
o Xn+l - y/a= ~ > 0, 

o {xn} decreases.] 



2.100 11. Let c and xo be positive real numbers. Show that the sequence defined 
inductively by 



Xn-\-l 




) 



converges to 



2.101 f ^ Logarithm-arccosin algorithm. Consider the sequence 



Sn+l • — ^71 



2 Sn 

Sn + Sn-1 



n = 0,l,.... 



(i) Fix X > 0. Set 

and prove that Sn — ^ logx as n — >• oo. 

(ii) For X G [—1, 1], set 

S-i := xy/l — x^, so := x 
and prove that Sn — >• arcsinx as n — > oo. 

[Hint (i) Show that Sn = S{2~'^) where S{h) := — x~^), (ii) show that Sn = 

R{2~‘^) where R{h) := and a = arcsinx.] 



2.102 f Given a, 6 > 0 set 

A{a,b) := the arithmetic mean, 

G(a, b) := y/a, 6, the geometric mean, 

H{a,b) := the harmonic mean. 

(i) Show that the sequences {xn} and {yn} defined recursively by 

Xo = a, 2/0 = b, 

^ Xn-\-l — A{Xn^yn)j 

,2/n+l = G{Xn^yn) 

converge to the same limit, called the arithmetic-geometric mean of a and 6. 
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(ii) Show that the sequences {xn} and {yn} defined recursively by 

xo = a, 2/0 = b, 

^Vn+l — -^{Xn^yn) 

both converge to the geometric mean G(a, b) of a and b. 

2.103 Stirling’s formula. Set an := logn! — | logn. Show that 

(i) f ^2 xdx < an < fi log x dx, 

(ii) Sn := logxdx^ — an is decreasing and bounded below, thus has a limit 

6 eR, 

(iii) deduce that the rough formula holds 

n! ~ e^n^e~'^y/n. 

[Hint: Compare with Example 2.67 and Section 7.4.] 

2.104 f % Singular perturbation. Let / : R — ► M. Show that 

(i) for any y £R and n 6 N the function 

if{x) - y)^ + 

n 

has a minimum point Xn, 

(ii) while {xn} may be unbounded, the sequence {xn/\/n} is bounded, 

(iii) the sequence of minimum values 

{f{Xn)-yf+-xl 

n 

has the limit as n ^ oo. 
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Leonardo Pisano 
(1170-1250), called Fibonacci 



Campanus of Novara 
(1220-1296) 



Jean Buridan 
(1295-1358) 



Nicole d’ Oresme 
(1323-1382) 



Karl Feuerbach 



Johannes de Sacrobosco 
(1195-1256) 

Albert of Saxony 
(1316-1390) 



Scipione del Ferro 
(1465-1526) 



(1800-1834) 

Luca Pacioli 
(1445-1517) 

Nicolaus Copernicus 
(1473-1543) 

Colin MacLaurin 
(1698-1746) 



Johann Regiomontanus 
(1436-1476) 



Rudolf Stiffel 
(1487-1561) 



Niccolo Fontana Girolamo Cardano Lodovico Ferrari Rafael Bombelli Christopher Clavius 

(1500-1557), called Tartaglia (1501-1576) (1522-1565) (1526-1573) (1537-1612) 



Frangois Viete 
(1540-1603) 


Simon Stevin 
(1548-1620) 


Bartholomeo Pitiscus 
(1561-1613) 


John Napier 
(1550-1617) 


Galileo Galilei 
(1564-1642) 


Henry Briggs 
(1561-1630) 



Paul Guldin 
(1577-1643) 



Marin Mersenne 
(1588-1648) 



Girard Desargues 
(1591-1661) 



Bonaventura Cavalieri 
(1598-1647) 



Rene Descartes 
(1596-1650) 



Pierre de Fermat 
(1601-1665) 



Gilles de Roberval 
(1602-1675) 



Evangelista Torricelli John Wallis 

(1608-1647) (1616-1703) 

Christiaan Huygens 
(1629-1695) 



Blaise Pascal 
(1623-1662) 



Vincenzo Viviani Pietro Mengoli 

(1622-1703) (1626-1686) 

Sir Isaac Newton 
(1643-1727) 



Isaac Barrow Robert Hooke James Gregory 
(1630-1677) (1635-1703) (1638-1675) 

Gottfried von Leibniz 
(1646-1716) 



Figure 2.12. A chronological table from Fibonacci to Newton and Leibniz. 




3. Integer Numbers: 
Congruences, Counting 
and Infinity 



In this chapter we collect a few complements to the theory of integers. 
In Section 3.1, after discussing Euclid’s algorithm^ and the fundamental 
theorem of arithmetic^ we deal with Euler’s function and some of its appli- 
cations to public key cryptography. In Section 3.2 we introduce a few basic 
elements of combinatorics^ that is, the calculus of arrangements of a finite 
number of objects. Finally, in Section 3.3, we illustrate the notion of cardi- 
nality (or number of elements) of a (not necessarily finite) set introducing 
some of the concepts involved in Cantor’s theory of infinity. 



3.1 Congruences 

3.1.1 Euclid’s algorithm 

Any subset of the integers, which is bounded above, has a maximum (see, 
for example. Proposition 1.23). An easy consequence of this is division in 
the context of integers. 

3.1 Proposition. Let a, 6 G Z, 6 7^ 0. Then the number a uniquely de- 
composes as 

a = qb-\-r (3.1) 



with q,r e N and 0 < r < |6|. 

Proof. Suppose 6 > 0 and let q be the largest integer not greater than a/6, 
which exists by (vi) Proposition 1.23. Prom p < a/b < p 1 we get for 
r := a — qb that 0 < r < 6. 

If 6 < 0, choose p as the smallest integer not smaller than b/a: from 
p > a/b > p — 1 we get for r := a — qb that 0 < r < —b. 

It remains to prove that the decomposition is unique. If a = ^16 + ri = 
^2& + r2 with 0 < ri,r2 < |6|, then {q2 ~qi)b = (ri -r2): that is, ri -r2 is 
an integral multiple of b. Since |ri — r2| < 6, we conclude that ri —r2 = 0 
and in turn that qi = q2- D 
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In (3.1) a is called the dividend^ b the divisor^ q the quotient and r the 
remainder. We write 



q =: a//6, r =: a mod 6, 

hence a = (a/ /6) 6 + a mod 6, Va, 6 G Z, 6 7^ 0. 

Let a^b e Z and 6 7^ 0; we say that b divides a or that b is a divisor 
of a, if a 25 a multiple of 6, that is, a = bq for some q eZ. In this case we 
write b\a. Of course b\a if and only if a mod 6 = 0, and, if a is a multiple of 
b and 6 is a multiple of c, then a is a multiple of c. Moreover, if a and b are 
multiples of c, then ax + 6t/ is a multiple of c, too, Vx, 2/ G Z. In particular 
a and b are multiples of c if and only if a and a mod b are multiples of c. 



a. The greatest common divisor 

Let a, 6 G Z be nonzero integers. The set of common divisors of both a and 
b is nonempty and bounded. The largest of those numbers is called the 
greatest common factor or the greatest common divisor of a and 6, and is 
denoted by 

g.c.d. (a, 6). 

In other words, r G Z is the greatest common divisor to a and b if and 
only if 

o a and b are multiples of r, 
o if a and b are multiples of s, then s <r. 

Trivially g.c.d. (a, 6) = g.c.d. (6, a) = g.c.d. (|a|, |6|). Finally, we say that 
a, 6 G Z are prime to one another or coprime if g.c.d. (a, 6) = 1. A number 
p is said to be prime if p > 1 and p has no positive divisor except 1 and p. 



3.2 Show that 

(i) g.c.d. (a, 6) = 6 if and only if b divides a. 

(ii) Let p be prime. Then either g.c.d. (a, p) = p, that is a is a multiple of p, or 
g.c.d. (a,p) = 1, that is a and p are coprime. 

We mentioned in Chapter 1 Euclid’s algorithm as a method of finding 
a common submultiple to two magnitudes, discovering that in general it 
generates a process that never ends. When applied to two nonzero inte- 
gers a and 6, Euclid’s algorithm stops after a finite number of steps and 
produces the greatest common divisor of a and b. 

3.3 Euclid’s algorithm. Let a, b be positive integers with a > b. The 

algorithm consists in dividing the larger of the two numbers by the smaller, 
then the smaller by the remainder of the first division, then the remainder 
of the first division by the remainder of the second, and so on. Formally, 
we set ro •= ^1 := and 
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# ! /usr/bin/env python 

def euclidCa, b) : 
rl, r2 = a, b 
while r2 != 0: 

rl, r2 = r2, rl mod r2 
return rl 

if name == * main 

print euclid(168, 14) 



Figure 3.1. Euclid’s algorithm in Python. 



V 2 := To mod ri, 

rs := ri mod r 2 , (3.2) 

r 4 := T 2 mod rs, 



Since tq > ri > V 2 > > • - are nonnegative integers, the process will 

terminate with remainder zero after a finite number of steps, 

Vn := Vn-2 mod Tn-l, 

rn +1 := Tn-i mod Vn = 0. (3.3) 



We have 

3.4 Theorem (Euclid), = g.c.d. (a, 6). 

Proof. Observe that if a = bq-\- c and c 0, then r divides a and b if and 
only if r divides b and c. Consequently g.c.d. (a, b) = g.c.d. (6, c). Iterating 
this observation along the algorithm, we get 

g.c.d. (a, b) = g.c.d. (ro, n) = • • • = g.c.d. (r„_ 2 , r„_i) = r„. 



□ 



3.5 Remark. Euclid’s algorithm can be written as 

ro = a, ri = b, 

< qk = Tk-illrk, 
jk-i = Qkrk + rk+i, 



for k = 1,2, ...,n as long as rfe > 0, i.e., until r„+i = 0. Notice that 

Qn > 2 . 
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Figure 3.2. The frontispiece of the Works by Diophantus of Alexandria (200-284) 
with comments of Claude Caspar Bochet de Meriziac (1581-1638) and observations by 
Pierre de Fermat (1601-1665), Tolosae 1670, and the frontispiece of Algebra by Abu al 
Khwarizmi (790-850). 



3.6 Corollary. Let a, b and A be integers with a, 6 > 0. Then 
g.c.d. (A a, A 6) = A g.c.d. (a, b). 

Proof. Let {r^}, {^n} be the lists of remainders produced by Euclid’s al- 
gorithm starting respectively with a^b and Aa, A6. Since A a mod (A 6) = 
A (a mod 6), we deduce that Sn = Ar^ Vn. Thus rn+i = 0 if and only if 
Sn+i = 0, and 

g.c.d. (Aa, A6) = Sn = Ar^ = A g.c.d. (a, 6). 



□ 



3.7 Corollary. We have 

(i) if c divides a and bj then g.c.d. (a/c, b/c) = g.c.d. (a, 6)/c, 

(ii) a/g.c.d. (a, 6) and b/mcd{a^b) are coprime, 

(hi) if g.c.d. (a, 6) = 1 and a divides be, then a divides c. 



b. Integer solutions of first order equations 

We discuss now the solvability in Z of first order equations with integer 
coefficients, starting from the homogeneous case 

ax A- by = 0. 



(3.4) 
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3.8 Proposition. Let a, b be nonzero integers. Then the equation (3.4) 
is solvable in Z and all solutions are given by the family of pairs [xk^yk) 
given by 

{xk,yk)-=k \ — —(6, -a), keZ. (3.5) 

max(a, b) 

Proof. Trivially all pairs in (3.5) solve the equation. Conversely suppose 
that {x^y) solves (3.4). Dividing by g.c.d. (a, 6) we have 

a _ b 

g.c.d. (a, 6)^ g.c.d. (a, 6)^’ 

in particular 6/g.c.d. (a, 6) divides the product on the left-hand side and, 
consequently x, since a/g.c.d. (a, b) and 6/g.c.d. (a, 6) are coprime (see, for 
example. Corollary 3.7). For some k eZ we then have x = fc — , and 
substituting into (3.4), we conclude. □ 

Consider now the nonhomogeneous linear equation 

ax + 6y = c, c^O. 

3.9 Theorem (Bezout’s theorem). Leta^b^c be nonzero integers. The 
nonhomogeneous equation ax-\-by = c is solvable if and only if g.c.d. (a, 6) 
divides c. 

Proof. Suppose x^y e Z solve ax by = c. Trivially g.c.d. (a, 6) has to 
divide c. Conversely, suppose that g.c.d. (a, 6) divides c. We shall just prove 
that there exist integers x and y such that ax by = g.c.d. (a, 6). 

Consider the following recurrence scheme, known as the generalized 
Euclid algorithm^ 

Vo = a, ri= 6, 

'f'k+i = -Qk^k + Qk •= rk-illrk, 

Xq = 1 , xi= 0 , 

^fc+l “ Qk^k “1“ 1? 

yo = 0 , 2/1 = 1, 

<2/fc+i ~ ~Qkyk “1* yfc— ij 

for fc := 1, 2, . . . , n as long as rn > 0. Of course the r^’s are the remainders 
of Euclid’s algorithm, and it is easily seen by induction that axk + byk = rk 
Vfc = 0 , 1 , . . . , n. Then Euclid’s theorem yields 

g.c.d. (a, b) = rn = axn + byn 

as required. Going back to the equation ax by = c, a solution is then 
given by 
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# ! /usr/bin/env python 

def euclid2(a, b) : 

# assume a, b>0, a>b. 
rl, r2 = a, b 
xl, x2 = 1, 0 
yl, y2 = 0, 1 
while r2 <> 0: 

q, r= divmodCrl, r2) 
xl, x2 = x2, xl - q*x2 
yl, y2 = y2, yl - q*y2 
rl, r2 = r2, r 
return rl, xl, yl 

if name == * main ’ : 

a, b = 1224, 12*17 
c, X, y = euclid2(a, b) 
print ’gcd=* , c 
print X, a, y, b 
print a*x+b*y 

Figure 3.3. A Python implementation of the generalized Euclid’s algorithm that com- 
putes a solution of ax by = g.c.d. (a, b). 



{x,y) 



C 

g.c.d. (a, 6) 



{00n,yn)- 



□ 



3.10 Remark. We also remark that Euclid’s algorithm is quite efficient. 
Suppose that we perform two successive divisions 



r2=romodri, (3.6) 

rs = ri mod r 2 , 

then rs < ri/2. In fact, either r 2 < r\/2 and rs < V 2 < t\/2, or r 2 > ri/2, 
that is 2 r 2 > ri, hence gs = 1 and again rs = r\—r 2 < ri/2. 

Euclid’s algorithm in (3.2) requires at most n + 2 divisions in order 
to stop at a zero remainder, thus, if we denote by p the integral part of 
(n + l)/2, we have in the worst case 

1 < rn-i < rn- 2/2 < • • • < 6/2^ 

i.e., p < log 2 &. Therefore (n + l)/2 < p H- 1/2 < log 2 & + 1/2, that is, 
n < 21og2 b. We then conclude that Euclid's algorithm stops after at most 
2 times the number of digits in the binary representation of b. For an 
optimal estimate, see Corollary 8.9. 

3.11 Show that g.c.d. (a, b) is the minimum of the set 

A := |n G N I n > 0 and n = ax by, for some x, t/ G z|. 
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Figure 3.4. Pierre de Fermat (1601-1665) and Leonhard Euler (1707-1783). 



3.1.2 Prime factorization 

3.12 Theorem (The fundamental theorem of arithmetic). Every 
integer n > 2 decomposes as a product of primes, and, apart from rear- 
rangement of factors, such a decomposition is unique. 

Proof. We proceed by induction on n. If n is prime there is nothing to 
prove; otherwise, if p is the smallest of its divisors, p has to be prime as 
n = mp. Since m < n, by induction m and in turn n are a product of 
primes. 

To prove uniqueness of the factorization, recall that by Corollary 3.7 if 
p is prime and p divides nm, then p divides n or m. Suppose p\P 2 ' "Pr = 
^ 1^2 " 'Qs where Pi, . . . ,Pr? ^i? • • • > are prime. For each j = 1, . . . , r, pj 
and the factors are coprime, thus necessarily pj is one of the qi. It follows 
that r < s, and, changing the p’s with the q's, we get r = s, and, apart 
from rearrangement, p^ = for alH = 1, . . . , r. □ 

If we arrange the factors of the decomposition of n G N in increasing 
order, we obtain that every integer decomposes uniquely as 

n = pTvTvT "'VT 

where 2 < pi < p 2 < • • • < Pfc are prime, and a\,...,ak > 1. 



3.13 Theorem (Second Euclid’s theorem). The number of primes is 
infinity. 



Proof. Suppose p prime. Let 2, 3, 5, ... ,p be the set of primes up to p; and 
let 



n := 2 • 3 • • • (p — l)p + 1. 



Then n is not divisible by any prime between 2 and p. On the other hand 
n has a decomposition in primes; therefore either n is prime or divisible 
by a prime between p and n. In either case there is a prime greater than 

p. □ 
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3.14 f . Show that g.c.d. (a, b) is the product of factors common to a and b, i.e., is the 
greatest common factor to a and b. 

How do we identify the list of primes? 

The simplest method consists in relying upon the definition of prime 
numbers. If n = a6, then a and b cannot exceed y/n; therefore any number 
that is not prime is divisible by a prime p which does not exceed y/n. In 
order to decide whether p is prime, it suffices then to divide n by all primes 
less than y/n. Notice that the method allows us to factorize p if p is not 
prime. In principle all is fine, but the method is impracticable even for 
numbers that are not very large: to conclude we need to do roughly y/n 
divisions: if 10“^ seconds is the time needed to carry out a division, the 
time necessary to factorize a number of 100 digits is around = 2^^ 

seconds, i.e, about 32 years. 

A variant of the previous method is a procedure known as the sieve of 
Eratosthenes that allows us to find all primes not greater than n once we 
know all primes smaller than y/n. It works as follows. Suppose we have a 
list {Ps(n)} of all primes less than y/n. Prom Ps(n) + 1? PsM +2, . . . , n 
we strike out successively all multiples of pi, p 2 , . . . , Ps{n) (up to n). The 
remaining numbers are all primes. In fact, if q is one of these numbers, it is 
not divisible by any of the primes less than y/n^ and consequently less than 
n. Also the sieve of Erathostenes, though requiring multiplications instead 
of divisions, requires a number of multiplications of order y/n. One says 
that the computational complexity of the sieve of Eratosthenes is 0(2^^/^), 
N := log 2 n being the number of digits of n. 

In the eighteenth century eventually the idea of looking for a com- 
plete description of primes was given up, and research moved toward a 
kind of statistical approach. The fundamental result in this direction is 
the remarkable prime number theorem., first conjectured by Adrien-Marie 
Legendre (1752-1833) and then proved by Jacques Hadamard (1865-1963) 
and Charles de la Vallee-Poussin (1866-1962) 



3.15 Theorem (The prime number theorem). Let 7t(x) denote the 
number of primes not greater than x. Then 



X 



Too x/logx 



lim 



= 1 . 



There seems to be evidence that xjlogx is a good approximation of 
7 r(a:): a celebrated conjecture, Riemann’s conjecture., states that 

for some e > 0, but it has not been proved or disproved up to now. Inci- 
dentally, observe the unexpected relation between prime numbers and the 
Euler’s number stated by the prime number theorem. 
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3.1.3 Linear congruences 

Let p eN. We say that a is congruent to b modulo p and write 

a = b (mod p) 

if a mod p = b mod p. Equivalently a = b (mod p) if and only if a — b = hp 
for some /i G Z. It is easily seen that 

(i) if a = 6 (mod p) and a' = b' (mod p), then a + a' = 6 + 6' (mod p) 
and aa' = bb' (mod p) 

and that congruence (mod p) is an equivalence relation^ i.e., it is 

(i) Reflexive. a = a (mod p), 

(ii) Symmetric, if a = 6 (mod p), then b = a (mod p), 

(hi) Transitive, if a = 6 (modp) and b = c (modp), then a = c 
(mod p ) . 

Equivalence classes of congruent numbers form a partition of all the 
integers into classes of numbers with the same remainder of the division 
by p. These classes can therefore be represented by the remainders of the 
division by p. Formally, one defines the set of remainder classes modulo p 
as 

Zp:= 

and the map x x modp, whose image is Zp, yields a way to introduce 
the sum and the product modulo p in Zp by 

a +p 6 := (a + b) mod p a-pb := (ab) mod p. 

Congruences turn out to be quite important in everyday life. Here we 
confine ourselves to basic facts. Prom the results on the solvability in Z of 
linear equations, we readily infer 



3.16 Proposition. Consider the equation in a: G Z, 

ax = c (modp). (3.7) 

(i) If c = 0, then all solutions of (3.7) are given by the family of integers 
{ccfe} given by 

Xk = 1^7 rfc, keZ. 

g.c.d. (a,p) 

(ii) If c^ 0, (3.7) has a solution if and only if g.c.d. (a,p) divides c, and 
all the solutions are given by the sequence of integers {xk} 

p c 

= T7 fceZ, 

g.c.d. (a, p) g.c.d. (a, p) 



where x is such that ax + py = 1 for some p G Z, and can be 
computed by the generalized Euclid^s algorithm. 
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In doing computations with congruences modulo a composite number, 
the following theorem is quite useful. 

3.17 Theorem (Chinese remainder theorem). Let pi,P 2 ? • • • ^Pk be 

coprime. 

(i) X eZ is a solution of the system 





i 


X = 0 


(mod pi), 






X = 0 


(mod pk), 


if and only if x 


= 0 (mod P1P2 • ■ 


•Pk)- 


(ii) For any ( 61 , 62 , 


. . . , 6 fc) € the 


system 






^ X = bi 


(mod pi), 




< 


X = b2 


(modp 2 ), 






= bk 


(modpfe), 



is solvable in x E Z. 

More precisely, for any i = 1 , . . . , fc, denote by Mi the product of all 
primes {pi} but pi, and let Ui be such that MiUi = 1 (mod pi). Then 

X := Yli=i^ibiO'i Is a solution of (3.8), and two solutions of (3.8) 
differ by a multiple of pip 2 • "Pk- 

Proof, (i) It suffices to observe that, if p and q are coprime, then a is a 
multiple of both p and q if and only if a is a multiple of pq, and proceed 
inductively. 

(ii) For any i = 1,. . . ,k, observe that Mj = 0 (mod pi) for j ^ i, and, 
since Mi and pi are coprime, there exists G Z such that 

Miai = l (mod Pi). 



Consequently 



k 

X = ^ OjbjMj = OibiMi = bi (mod Pi). 

3=1 

Finally, (i) implies that the difference of two solutions of (3.8) is a multiple 
ofpiP2---Pfc- □ 

Of particular relevance in the theory of congruences is the multiplica- 
tive subgroup Z* of the nonzero elements of Zp, and the discrete exponen- 
tial map X a^ modp from Zp into Z*. As we can see in the table in 
Figure 3.5, we get a^ = 1 for all a 7 ^ 0 in Z 7 . This is a general fact first 
observed by Pierre de Fermat (1601-1665) 
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a 


q} 


0 ? 


a® 




a 5 


a6 


1 


1 


1 


1 


1 


1 


1 


2 


2 


4 


1 


2 


4 


1 


3 


3 


2 


6 


4 


5 


1 


4 


4 


2 


1 


4 


2 


1 


5 


5 


4 


6 


2 


3 


1 


6 


6 


1 


6 


1 


6 


1 



Figure 3.5. The table of a — ► for Z7. 



3.18 Theorem (Fermat minor theorem). Ifp is prime, thenaP ^ = 1 
(mod p) Va 0. 

The original proofs of many of the claims of Fermat were never found. The 
following proof is due to Euler (1739). 

Proof. By the binomial theorem (see, for example, Example 2.6), 

p-i / \ 

(1 + a)^ = oF -\- ^ ^ 

where (^) := . Since p is prime and k is less than p, all 

binomial coefficients (^), fc = 1, . . . ,p — 1 are multiples of p. Therefore 

(1 + a)^ = + 1 (mod p) Va G Z. 

Using the previous equality, it is not difficult to show by induction on a 
that = a (modp). In fact, trivially 1^ = 1 (modp), and, if 6^ = 6 
(mod p), then 

(6 + 1)^ = 6^ -h 1 = 6 + 1 (mod p). 

Finally, if a ^ 0, g.c.d. (a,p) = 1, consequently, choosing x G Z such that 
ax = 1 (mod p ) , we conclude 

= ax = 1 (mod p). 

□ 

A different proof was given then by James Ivory (1765-1842) in 1808 
and presented again by Lejeune Dirichlet (1805-1859) in 1828. 

A different proof of Theorem 3 . 18 . If a 0 is a multiple of p, the theorem is trivial. If 
a is not divisible by p, the numbers 

a, 2 a, 3 a, . . . , (p — l)a 

are not pairwise congruent modulo p. After rearrangement they are then congruent to 
1 , 2 , . . . , p — 1 . Thus 

a • 2 a • 3 a • • • (p — 1 ) a = 1 • 2 • 3 • • • (p - 1 ) (mod p). 



that is, 
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22- 


1 = 


3 = 


3 


2^ - 


1 = 


15 = 


5-3 


2® - 


1 = 


63 = 


7-9 


2® - 


1 = 


255 = 


51-5 


2io _ 


1 = 


1023 = 


11-63 


to 

to 

1 


1 = 


4095 = 


13 • 315 



Figure 3.6. The values of 2^^ ^ — 1 for n = 3, 5, 7, 9, 11 and 13. 

1 • 2 • 3 • • • (p — 1) = 1 • 2 • 3 • • • (p — 1) (mod p). 

Since 2 • 3 • • • (p — 1) is not divisible by p, (v) yields 

= 1 (mod p) 



and in conclusion = a (mod p). 



□ 



3.19 Pseudo-primes. One says that p is a pseudo-prime if = a 
(mod p) for all a G Z. Of course primes are pseudo-primes, but there exist 
pseudo-primes that are not prime: they are called CarmichaeVs numbers, 
and the smallest is 561. Carmichael’s numbers are quite rare, only 255 are 
not greater than 10®. Therefore it is likely that a number that is chosen 
randomly and verifies Fermat’s test, is prime with probability close to 1. 
This means that the test = 1 (mod p) Va G Z, is a good indication 
that p is prime. 

Fermat’s test with a = 2 was actually the genesis of Fermat’s theorem. 
Consider the table in Figure 3.6. It shows that — 1 is divisible by 

n if n is 3, 5, 7, 11, 13. After many calculations Gottfried von Leibniz 
(1646-1716) conjectured that 2'^~^ — 1 is divisible by n if and only if n is 
prime. Fermat’s theorem proves that 2^“^ is divisible by n if n is prime, 
but F. Edouard Lucas (1842-1891) in 1819 showed that the converse is 
not true. He showed that 



1 2®^o = 1 (mod 11), 

[2®^° = 1 (mod 31), 

being that 2^ = —1 (mod 11) and 2^ = 1 (mod 31). Consequently 2®^^ = 1 
(mod 341), i.e., 2®^° — 1 is divisible by 341 = 11-31, which is not prime. In 
recent years, the probability has been computed that a number n satisfies 
Fermat’s test with a = 2 but is not prime. It turned out that this proba- 
bility is quite small: for a randomly chosen number of, say, 200 bits, it is 
of the order 2.6 • 10“®. 



3.1.4 Euler’s function (f) 

Given an integer n > 2, we denote by 0(n) the number of integers not 
greater than and prime to n (and therefore greater than 2), and we set 
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Figure 3.7. The map x mod 15. 



0(0) = 0(1) = 1. The function 0 : N ^ N defined this way is called Euler’s 
function 0. Clearly, 



0(p) = p — 1 if p is prime. 

Suppose that p and q are coprime. Since p and q have no common divisors, 
we get that Euler’s function is multiplicative^ i.e., 

(f>{PQ) = if P^ Q coprime. (3.9) 

In particular 

(j){pq) = [p — l)[q — 1) if p, q are two distinct primes. 

Now we shall compute 0(p^), p prime. Noticing that z > 2 divides p^ if 
and only if 2 ; is a divisor of p, we compute 



#{a:>2 



gcd{x,p’^) 1} = I 2 < /ip < p*'! 

= #{*|l</i<p''-'}=p''-'-l, 



and consequently 

4>{p’^) -p'' - 1 - (p*=-i - 1) =p'=(l - i). (3.10) 



The unique factorization of any number in primes, (3.9) and (3.10) finally 
yield 
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3.20 Theorem (Euler). If n = Pi^P 2 ^ decomposition in 

primes of n, n > 2, then 

We also prove 

3.21 Theorem (Euler). If a and n are coprimes, then 

oft>{n) = (mod n). 

Proof. Observe that, a and n being coprimes, x is coprime with n if and only if ax is 
coprime with n. Moreover ax and ay are congruent modulo n if and only if x and y are 
congruent modulo n. In other words, the map x ax mod n is a bijection of the set 

£? := G {1, . . . , n — 1} I a; and n are coprime|, 

into itself. In particular 



U X = n X, 

xeE xeE xeE 



that is, 

= I (mod n). 

□ 



3.1.5 RSA Cryptography 

Cryptography deals with the confidential transmission of data. Basically 
the sender codes the message M to a new object C = f{M) by an injective 
map / defined on all possible messages, so that the addressee can decode 
the message by the inverse map, M = The function / is called the 

cryptographic function. For several reasons, for instance because any cryp- 
tographic function becomes less secure by use, it is important to change it 
from time to time and consider instead a cryptographic system, that is a 
family {fk}keK of cryptographic functions indexed by a parameter called 
a key. Moreover, it is convenient to consider the cryptographic system as 
public and to ground the confidentiality of the system on the key. 

One of the oldest ways of communicating secretly reduces to choosing 
openly a cryptographic system and then exchanging secretly a common key 
to code and decode messages. We then get a bilateral communication, that 
is the subjects are both senders and addressees. Among the recent algo- 
rithms for confidential connections between computers based on a common 
key, let us quote the AES (Advanced Encryption Standard) system, which 
is public, fast, and well discussed in the literature. 

Exchanging a key is quite simple when the two subjects can meet in a 
safe place to physically exchange the key. Otherwise, they have to find a 
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safe channel of communication to send the key, but this leads us back to 
the starting point. 

In 1976 DifRe and Heilman proposed a method that, in principle, en- 
ables people A (Alice) and B (Bob) to exchange safely a secret key, using 
an unsafe channel. It is based on modulo p arithmetic and on the different 
times necessary to compute the discrete exponential 

X ^ y := mod p 

and the discrete logarithm^ that is, to solve the inverse problem of finding 
X eZ such that 

=y mod p. 

In fact, on the one hand the modular power operation can be computed 
with few multiplications. Writing x in binary form, that is x = X^r=o 
Xi = 0 or 1, it is easy to see that the computation of mod n resolves 
in shifts and a number of multiplications of the order of the number of 
bits of X. On the other hand, the best known algorithm to solve mod p 
needs 2^^^^ multiplications.^ For large N, the second operation cannot be 
performed in a few years even using very powerful computers. 

The procedure is as follows. Suppose that Alice and Bob want to ex- 
change a numerical key k through a nonsafe public channel. First Alice 
and Bob openly choose a prime number p that is large enough and a 
number between 2 and p — 1. Then Alice chooses a number x that she 
keeps to herself and openly sends Bob C := mod p. Analogously, Bob 
chooses a number y that he, too, keeps to himself and openly sends Alice 
D := mod p. Alice and Bob are now able to build the common key by 
combining their secret number with the datum received from the other 
user: Alice computes 



mod p — mod p 



and Bob computes 

mod p = mod p. 

The two keys are identical. Nobody else will be able to recover x or p 
starting from p, a, C and D, without inverting the modular exponential. 

In 1978 Rivest, Shamir, and Adelman proposed an algorithm that is 
now used in many programs for the so-called public key cryptography. Let 
us start with the following generalization of Fermat’s theorem that is a 
key point in the RSA cryptographic system (see, for example. Figure 3.7). 

3.22 Theorem. Let p, q be two distinct primes and let n := pq. Then for 
all X and fc G Z, 

^i+fc</)(n) _ ^ (mod n), 

0(n) being the Euler's function, (j){n) := (p — l)(g — 1). 



^ Well, if < p, then the answer is immediate. 
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Proof. If a is not a multiple of p, by Fermat’s theorem we have ^ = 1 (mod p), 
hence 

^l+fc0(n) ^ ^ py 

If a is a multiple of p, then = 0 = a (mod p) for all r, in particular = 0 = a 

(mod p), thus = a (mod p). In all cases we infer 

^i+fc</)(n) ^ ^ (mod p). 

Similarly 

^i+fc(/)(n) ^ ^ (mod q)^ 

and, by the Chinese remainder theorem, = a (mod n). □ 

3.23 Corollary. Let p, q be two distinct primes and let n := pq. Let e, d 
be such that ed=l (mod 0(n)). Then the maps 

X ^ mod n, y mod n 

from Zn into Zn are each the inverse of the other. 

Proof. In fact ed = 1 + k(f){n), hence, by Theorem 3.22 

(x^ mod = x (mod n). 



□ 

Assume that A (Alice) wants to communicate a secret to B (Bob). First 
Bob does the following: 

(i) he selects two primes p, q, and computes the modulus n := pg, and 
the Euler’s function (/>(n) := (p — 1)(^ — 1), 

(ii) he selects a third prime e and computes d such that ed = 1 
(mod (j){n))^ e.g., by the generalized Euclid algorithm, 

(hi) he publishes as public key the pair (n, e) and keeps as a private key, 
his secret, the pair (n, d). 

Assume that Alice (or anybody else) wants to send a confidential message 
to Bob. Firstly, she gets the public key (n, e) of Bob, then codes the mes- 
sages as a number M < n, and finally she sends Bob C := mod n. Bob 
is then able to recover M by computing mod n, according to Corol- 
lary 3.23. 

Besides the ability of Bob to decode the message, it is worth discussing 
the possibilities for an attacker to read the message, or, worse, to find the 
secret key (n, d). We make only a few remarks. 

o To decide if a given random number of around 256 digits is prime, Fer- 
mat’s test with base a = 2 is usually sufficient, even if not totally secure. 
As a by-product, Fermat’s test is fast, being based on modular powers. 
Recently, a fast algorithm of order 0{N'^) N = log2P was found that 
decides if a number p is prime or not. The algorithm does not exhibit a 
prime factor of p in case p is not prime, however. 
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As we have already seen, computing mod n uses a number of mul- 
tiplications of order 0(6), 6 being the number of bits of x. Assuming 
that the public and the secret exponents are between 0 and n, we con- 
clude that the computational complexity of coding and of decoding the 
message is of order 0(A/'), N := log 2 n. 

Computing n := pq requires one multiplication, while finding the fac- 
tors p, q from n, while in principle feasible, requires too much work for 
numbers n = pq of, say, 256 bits. The sieve of Erathostenes is too slow 
requiring 2^/^ multiplications, N = log 2 n, and even the better meth- 
ods of factorization based on the LLL algorithm of Lenstra, Lenstra and 
Lovasz, predict a number of multiplications of order 
roughly 2^^ for numbers n of 256 bits. Of course, this estimate holds for 
products of primes randomly chosen, while for special products the time 
of factorization can be shorter. Concluding, the choice of the primes p 
and q is somewhat critical, but generally speaking, choosing p, ^ in a 
random way with around 128 bits each, makes it practically infeasible 
to find in a short time the factors p, q from pq, and therefore impossible 
to get (j){n) and d from n this way in a short lapse of time. 

The RSA algorithm is clearly less secure than the factorization of inte- 
gers, since it publishes also the number e. In fact a clever attack can be 
made to find d if d is small enough. 



3.24 Theorem (Wiener, 1981). Let p, q be primes with p < q < 2p. 
Let n = pq, and let 0 < e,d < 0(n) be such that ed = 1 (mod (/>(n)). If 

then one can find d with an algorithm of computational complexity of 

0{N),N:= log2n. 

Proof, We have 0 < ed — 1 = k(f){n) < d(j){n), from which we infer 
0 < ^ < d and g.c.d. {k, d) = 1. Moreover p + q < 3y/n, therefore we can 
infer 

de — kn 
dn 

\de — fc(^(n) 4- k{(j){n) — n)\ _ |1 + k{(j){n) — n)\ 
dn dn 

k {n — 0(n)) ^ 3 
d n y/n 



e k 
n d 



< 



and 



e k 1 
n~d 



since 1/v^ < l/(\/6d). Therefore k/d is one of the reduced continuous 
fractions of e/n, see Theorem 8.35. Since they can be all computed easily 
with 0{N) complexity by Euclid’s generalized algorithm, the proof is 
concluded. □ 
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o A low exponent e, especially e = 2,3, can be trouble, too, but the 
published flaws are far from a total break. Choices of e with around 17 
bits are usually made. 

o The RSA systems yields a way to make confldential communications 
from many to one. For a communication that involves two people, or two 
computers, the AES ecryption standard, or other algorithms based on a 
common key, are preferred since they are by far faster than RSA. Thus 
RSA is often used for the only purpose of transmitting a secret key; then 
the actual communication is encrypted by the AES or similar algorithms. 
In this respect, the confldentiality of the transmitted message by RSA 
has to be analyzed, too. Despite 25 years of use of RSA, very few results 
are known on this subject. The basic question, whether inverting the 
RSA coding function is computationally equivalent to the factorization 
of n := pq^ seems today a largely open issue. 



The RSA algorithm can be useful also for authentication purposes. 
Assume that Bob codes a given public message with his secret key. Then 
anyone else can decode the coded version of the original message by Bob’s 
public key, rediscovering the original message. In principle nobody can code 
the original message in such a way that the coded message, if decoded 
by Bob’s public key, agrees to the original, unless the coding was done 
by Bob’s secret key. This way. Bob can authenticate himself. However, 
the lack of mathematical evidence of the security of the RSA algorithm 
for authentication purposes and several documented breaks on some of 
the actual implementations, conflnes the RSA digital signature scheme to 
applications that need a mild form of authentication. 



3.2 Combinatorics 



We recall that a set X is said to be finite if there is a one-to-one correspon- 
dence of X with a subset of integers of the type {1, 2, . . . , n}. In this case, 
the number n of elements of X is called the cardinality of X and denoted 
by \X\ or #X. In other words, every flnite set X has \X\ elements and it 
can be ordered by given indices 1, 2, . . . , |A| to its elements. 

Starting from one or several flnite sets we can construct new sets either 
by selecting or arranging some of their elements or by taking unions, inter- 
sections or products. One refers to the procedures that allow computation 
of the number of elements of these new sets as combinatorics. This is a 
fascinating branch of mathematics with many applications, for instance, 
in engineering and social sciences. Here we conflne ourselves to a few basic 
concepts. 
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3.2.1 Samples, mappings and subsets 

Given n objects, how many different ways of selecting k objects from n do 
we have? In other words we want to compute the cardinality of the set of 
the selections of k objects out of n. The question is of course vague unless 
we specify 

o whether the objects are all of the same type or how many objects belong 
to each type, 

o the procedure according to which we make the choice, 
o how we count the selected configurations. 



a. Ordered samples and mappings 

3.25 Definition. A list, or an ordered sample, {xj}j=o,i,...,fc-i of size k 
from a set X, an ordered k-sample in shorty is an ordered selection of k 
elements from X. 



Two lists {xj} and {yj} are equal if xj = yj Vj = 0, 1, . . . , fc - 1, that 
is, if they contain the same elements arranged in the same order. 

An ordered fc-sample can be obtained by selecting its elements one 
after the other in many ways: for instance, we make the selection of each 
object from the entire population, so that the same element can be drawn 
more than once {sampling with replacement)^ or, an element once chosen 
is removed from the population {sampling without replacement). In the 
first case we are using arrangements with repetitions^ in the second case 
arrangements without repetitions. 

3.26 Arrangements without repetitions. What is the number of or- 
dered /c-samples without replacement from a population of n objects, called 
also arrangements without repetitions or k- permutations of n distinct ob- 
jects? We have n choices for the first element, n — 1 for the second, . . . , 
{n — k 1) for the A:-th. Consequently the number of ordered A;-samples 
without replacement from n objects is 

:= \Vn,k\ = n{n - l)(n - 2) • • • (n - A: + 1), 1 < k < n. 

For convenience we define also := Dq := 1. 

3.27 Permutations. An ordered n-sample from n objects is called a 
permutation. The set of permutations Vn of a set of n elements has the 
cardinality 



P„:= |P„| = P;^ = n(n-l)---3-2-l = n!. 



We agree that the empty set has only one possible permutation, so that 

Po := 1 =: 0!. 
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3.28 Arrangements with repetitions. In the case of ordered samples 
with replacement, the number of choices for the first element of a list as 
well as for the others is always n, thus the cardinality of the set of ordered 
fc-samples with replacement from n objects, called also permutations with 
repetitions of k objects from n, is 

:= \K,k\ = l<k<n. 

We also set for convenience := := 1. 

3.29 Maps. Ordered fc-samples from n-elements with repetitions can be 

of course identified with the maps / : {1,2, {l,...,n}, since 

every such map is defined by the list of its values 



(/(l),/(2),...,/(A:)). 



Prom 3.28 the next proposition follows. 



Proposition. Denote by the set of all maps from X to Y. If 

= k and fj^Y = n, then T{X, Y) has elements. 



Example. Given A C X, the characteristic function of A is defined by 

{o a X ^ A. 

The correspondence A C X with the map <pA : X — ^ {0, 1} is clearly one-to-one. 
Therefore subsets of X are as many as the maps from X into {0, 1}, that is V{X) = 2'^. 



3.30 Injective maps. Also fc-samples without repetitions from n ob- 
jects can be of course identified with maps, the injective maps from 
(0, 1, . . . , fc — 1} to {0, 1, . . . , n — 1}. Consequently from 3.26 and 3.27 we 
get 



Proposition. Denote by I(A, Y) the set of maps from X to Y that are 
injective. If #X = k and #Y = n, then 



#I(X,r)| = n^ = n(n-l)(n-2)---(n-fc + l) 

In particular the number of bijective maps from X into itself is 



#I{X,X) = k\ 
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b. Nonordered samples and subsets 

We often sample k elements from n objects, but the actual order of the 
elements in the resulting arrangement is unimportant, that is, two samples 
may be considered equal if they contain the same elements, irrespectively 
of the order. We then speak of nonordered samples. 



3.31 Nonordered samples without replacement. As we have seen, 
the number of ordered k samples with replacement from n objects is D!^ 
(see, for example, 3.26). Since we have k\ different ordered samples of the 
same k objects, the number of nonordered fc-samples without replacement 



IS 



C^:= 



n{n — 1) • • • (n — fc + 1) 
k\ 



l<k< 



n. 



We also set for convenience = Cq = 1. Nonordered fc-samples without 
replacement from n objects are also called k- combinations of n distinct 
objects. 

The binomial coefficients are defined for all a G M and fc G N as 



a{a — l)(o; — 2) • • • (a — fc + 1) 

fc! 



and the following formulas hold 



n! 



kj k!(n-k)! 
n\ / n 



Vk, n 6 N, 



ay 



= 1 , 






Vfc G N, a e 



n 

n — k 



n fn — 1 



fc Vfc — 1/ ’ 



Vfc = 1,2, ... ,n. 



The last formula is called Pascal ^s formula. 



Vfc, 1 < fc < n. 



3.32 Subsets of finite sets. A nonordered fc-sample without replace- 
ment from a set X of n elements, is merely a subset A C X of fc elements. 
Therefore, from 3.31 we get 



Proposition. Let #X = n. The number of subsets A of X with fc ele- 
ments is , , 



{AGV{X)\\A\^k]\ = C^ 
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In particular, the number of subsets of X, including the empty set, i.e., 
the cardinality of 'P(X), is 



fc=i 



'+E r =E 



fc =0 



(1 + 1 )’" = 2 ^, 



on account of the binomial theorem. 



c. Ordered lists 

3.33 Definition. Let X be an ordered set by <. A monotonic fc-list 
{xi} C. X is also called an ordered list. 

3.34 Increasing lists. Let X be an ordered finite set. It is not restrictive 
to assume that X = {1,2,..., n}. Let us compute the number of the 
increasing lists of k numbers between 1 and n, i.e., of the type 

, /l2j • • • 5 ^ 5 Vi 1 ,...,Aj 1. 

Thinking of k lists from n objects as maps from (1, . . . , n} to (1, . . . , n}, 
it is easy to see that ordered fc-lists correspond to the strictly increasing 
maps. Since there is a unique way of listing k numbers in an increasing 
order, ordered fc-lists are equal in number to the subsets of fc elements of 
X, that is 




3.35 Nondecreasing lists. Let us compute the number L*^ of non- 
decreasing ordered lists of fc objects from n, i.e., of fc-uples {hi} with 
hi < hi^i Vi = 1, . . . , fc — 1. This time the elements of a list {hi} are not 
necessarily distinct. However, we can associate in a one-to-one fashion to 
each such nondecreasing fc-list from n objects a strictly increasing fc-list 
from n + fc — 1 objects by means of the map 0 defined as 

/i2, . . . , hk) := (/ii, fc-2 + Ij ^3 + 2, . . . , /ifc + (fc — 1)). 

It is easily seen that (j) maps the nondecreasing fc-lists from (1, . . . ,n} to 
the increasing fc-lists from {l,...,n+fc-l} and that $ is one-to-one. Thus 




Thinking of fc lists from n objects as maps from (1, . . . , n} to (1, . . . , n}, it 
is easy to see that nondecreasing fc-lists correspond to nondecreeising maps 
from (1, . . . , fc} to (1, . . . , n}. 
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Figure 3.8. The frontispieces of the Ars conjectandi by Jacob Bernoulli (1654-1705) and 
a modern treatise about discrete mathematics. 



3.36 Nonordered samples with replacement. Since the nonordered 
fc-samples with replacement from n objects are clearly as many as the 
nondecreasing fc-lists from the same objects, we conclude that the number 
of nonordered fc-samples with repetitions, also called k- combinations with 
repetitions from n is 

0-:=(” + '-') = (-l)‘(-;), !<*<". (3^11) 



d. The formula of inclusion and exclusion 

Let A^B C he finite and disjoint subsets of f2; then we have \AU B\ = 
\A\ + 1^1, and, more generally, in the case A fl B 7 ^ 0, 

\AUB\ = |A| -h \B\ - |A n B\. (3.12) 

The formula (3.12) extends to the case of more than two subsets and 
is very useful in computing for example the probability of incompatible 
events. The reader will check that in the case of three subsets we have 

\Ai U ^2 LI ^3 1 = \Ai \ + 1^12! + I.A3I — \Ai n yl2| “ 1^2 n A3I — \ Ai n As\ 
+ \Ai n A2 n A^\. 

Let Ai, ^ 2 , ^ An he finite subsets of f] and 1 < fc < n. Since the 

intersection of fc of the sets Ai, A 2 ^ ..., An is commutative, we index the 
intersection Ai^ fl Ai.^ H • • • fl Aij^ by an increasing list of indices ii < i 2 < 
• • • < ijfc. It is not difficult to show that we have 
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3.37 Proposition (Inclusion-exclusion formula). Let Ai, A 2 , ••• j An 

be finite subsets of Q. Then 

n 

k=l l<ii <22<"-<^fc^n 



e. Surjective maps 

Denote by S{X, Y) the family of surjective maps from X into Y, Trivially 
|5(X, y)| = 0 if \X\ < \Y\ while in the case l^l > \Y\ we have 

3.38 Proposition. Let X = {1,2,..., k},Y = {1,2,..., n} and n <k. 
Then 

|5(X,F)| = f^(-iy(”)(n-jy. 

j=o 

Proof. Let P{X,Y) be the family of all maps f : X ^ Y and, for j = 
1, . . . , n, Aj denote the family of maps f : X -^Y the ranges of which do 
not contain j. Trivially 



J^{X,Y)=S{X,Y)\JUjAj, 



hence the inclusion-exclusion formula yields 

|5(x,y)| = |;^(x,y)|-|AiU---uA,| 



: n 



^{-ly 



(3.13) 



j=l l<ii<"<ij<n 



For every j-ple (zi, . . . with distinct elements, Ai^ H ' - O Ai. is the 
family of maps whose images contain at most n — j elements; consequently 



and 



\Ai,n---nAiA = (n-jy 



\Ai,n---nAiy^{n~jy 1 

l<ii<-"<ij<n l<ii<--'<ij<n 

= {n-jy 



(3.14) 



The result then follows from (3.13) and (3.14). 



□ 
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4 


5 


6 




1 


5 


25 


125 


625 


3125 


0 




1 


5 


20 


60 


120 


120 


0 




1 


5 


10 


10 


5 


1 


0 




1 


5 


15 


35 


70 


126 


0 



Figure 3.9. The values of D^, Cf . 



3.2.2 Drawings 

3.39 Drawing in succession. The drawings in succession of k elements 

from n with replacement are as many as the ordered fc-samples with re- 
placement from n objects, (see 3.28), while the drawings in 

succession of k elements from n without replacement are as many as the 
ordered fc-samples without replacement from n objects, i.e., 

(see 3.26). 

3.40 Simultaneous drawings. Making a simultaneous drawing of fc ob- 
jects from n is clearly the same as choosing a subset of fc elements from 
n. Therefore the simultaneous drawings of fc elements from n are as many 
as the subsets with fc elements in a set with n elements, that is, = (^) 
(see, for example, 3.32). 

3.41 Simultaneous drawings with repetitions. Suppose instead we 
have a population of infinitely many elements of type 1, infinitely many 
elements of type 2,. . . , infinitely many elements of type n. The simulta- 
neous drawings of fc elements from this population are as many as the 
nonordered fc-samples with replacement, that is, = (’^'^^“^).The same 
result holds provided the initial population has at least fc elements of each 
type. 

The table in Figure 3.9 shows how results can be different. 



3.2.3 Location problems 

How many ways do we have of placing fc balls into n cells? Again the 
question is quite vague unless we specify how to distinguish the resulting 
arrangements and the rules to fill the cells. In this respect we look at 

o whether the balls are distinct, 
o how many balls can be placed into a cell. 

These kinds of problems arise typically in statistical mechanics. Several 
situations left vague by the previous description are quite relevant. The 
next examples describe some of them: they refer to the case of distinct 
cells. 
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3.42 Maxwell-Boltzmann statistics. The balls have a label which 
renders them distinct, moreover there is no limit to the number of balls 
that can be placed in one of the n cells. In this case the number of ways of 
placing k balls into n cells equals the number of maps / : {1,2,. ..,fc} — > 
(1,2, . . . ,n}, that is n^. 

If we instead require that only one of the distinct balls can be placed in a 
cell, we have as many possibilities as the injective maps / : (1,2,. ..,fc} — > 
{1,2, . . . ,n} are, that is, , 1 < fc < n. 



3.43 Bose— Einstein statistics. The k balls are all black, and there is 
no limit on the number of balls we can place in a cell. In this case each ar- 
rangement can be regarded as a sequence of black balls separated by white 
balls, which represent the cells. We therefore have as many possibilities as 
the number of the subsets of n — 1 elements in a set of n — 1 + fc elements. 



i.e.. 



n + fc — 1 
n — 1 



n + fc — 1 
fc 



= a 



*k 



Another way of thinking is that any such arrangement be regarded as a 
nondecreasing map from (1, . . . , fc} into (1, . . . , n}. 



3.44 Fermi— Dirac statistics. The fc balls are nondistinct, and we can- 
not place more than one ball in each of the n-cells. In this case 1 < fc < n 
and the number of possibilities are as many as the injective maps from 
{1, . . . , fc} into {!,..., n}, i.e., the subsets of fc elements in {1,2,..., n}, 

©■ 

3.45 f A physical system consists of some identical particles. The total energy of 

the system is = const > 0. Each particle may possess a level of energy kEo, 

k = 0,1, 2, 3, 4, and a particle of energy kEo may occupy one of the + 1 states 
corresponding to this energetic level. How many different configurations, according to 
the energetic state of the particles, can the system assume? Answer the same question 
assuming that (a) at the energy level kEo there are 2(fc^ + 1) energetic states, (b) two 
particles are not allowed to occupy the same energetic state. 



3.46 Example. How many lists of k integers exist with sum n? In other words, what 
is the cardinality of the set 

hrcfc =n| ? 

Interpret xi, . . . ,Xk as the number of nondistinct balls placed respectively in the cells 
{1, . . . , /c}. Then the initial problem reads as: in how many ways can n nondistinct balls 
be placed in k cells? The answer is 

rt-rvr 

The table in Figure 3.16 at the end of this chapter summarizes the 
different models of counting we have discussed in this section. 
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Figure 3.10. The frontispieces of the Doctrine of Chances by Abraham de Moivre (1667- 
1754) and a modern treatise on probability. 



3.2.4 The hypergeometric and multinomial 
distributions 

The birth of probability is dated back to Blaise Pascal (1623-1662) and to 
his correspondence with Pierre de Fermat (1601-1665) about a number of 
questions connected to the games of cards posed by the knight de Mere, 
who was a dogged gambler with mathematical velleity. The first published 
treatise, De Ratiociniis in ludo aleae, appeared in 1647 and is due to Chris- 
tiaan Huygens (1629-1695) ; it was followed by the Ars conjectandi of 1713 
by Jacob Bernoulli (1654-1705) and by The Doctrine of Chances of 1718 
by Abraham de Moivre (1667-1754). 

There are several definitions of probability. The first, and for this rea- 
son it is referred to as classical, is due to Blaise Pascal (1623-1662). The 
probability is the ratio between the favorable events and all possible events, 
provided all events are equiprobable. It is convenient to imagine the events 
as subsets A of a set Q of the possible causes, and it is usual to assign 
probability 1 to the certain event f2. The classical probability of the event 
A is then 

P(A) = C 

this way reducing to a problem of counting. 

3.47 The hypergeometric distribution. In this context a typical prob- 
lem is the following. We are given a set X of N distinct balls: K of them 
are white and N — K black. We simultaneously draw n balls from X. What 
is the probability for exactly k of them to be white? 
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We assume that the drawings are random, that is, all outcomes are 
equally probable. Therefore the probability is the ratio between the favor- 
able and possible events, the possible events on the drawings of n balls 
from N, i.e., (^). 

Denote by Ak the set of simultaneous drawings that contain exactly k 
white balls, k < n. Clearly \Ak\ = 0 either if k > since in this case 
there are not enough white balls, or if n — fc > TV — if, as in this case 
there are not enough black balls to produce an event with k white balls. 
If max(0, n — N-i-K)<k< min(n, if), we instead have \Ak\ 0 . 

Given one such fc, the number of simultaneous trials of k white balls 
from if white balls is (^) and, for each of them, there are (^I^) different 
ways of choosing n — k black balls. Therefore 

\Ak\ = { ( n-^) ™ax(0, n-N + K)<k< min(n, K), 

[O otherwise, 

thus the probability of drawing n balls with exactly k white balls in the 
trial from our set X is 



B{N,K,n)(k) 



o 



) 



(3.15) 



The (3.15) is called the hypergeometric distribution. 

Notice that the sets Ak are pairwise disjoint and their union yields all 
possible drawings. Therefore we have 



3.48 Proposition (Vandermonde formula). Let n E N; then 



E 

fc=max(0, n—iV+iC) 





Proof. In fact 



'N' 



\n. 



mm(n,K) 

k fc=max(0, n—iV+K') 




□ 



3.49 Write a proof of Vandermonde’s formula using the identity (1 + a)^ = 



3.50 f Quality inspection. In an industrial production of 1000 items, 2% of them 
are defective. Choosing at random 25 items, what is the probability of finding two or 
more defective items? 
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3.51 The multinomial distribution. Let H be a set; a partition of Vt 
is a decomposition of Cl in pairwise disjoint sets >li, ^ 2 , . . . , Ap, 

p 

^ = U n ^ ® for i 7 ^ j. 

i=l 

Let n := \Cl\ and := |A^|, i = 1, . . . ,p. Of course fci + H \- kp = n. 

Denote by C(fci, fe , . • • ^kp) the number of possible decompositions of Cl in 

p subsets drawing respectively fci, ^ 2 , • • • , elements with fci + ^2 H h 

kp = n. We then have 

= 

In fact, we have different ways of choosing ki elements of Cl^ then 
{^ 1^2 ways of choosing a further k 2 elements from Cl, . . . , and, finally, 
^n-(/ci+/e 2 + - +ATp-i)^ ways of choosing the remaining kp elements of Cl. 
Therefore 

C(fci,fc2,...,fcp) 

— (n - fei)! (n - (fci + A ;2 H h fcp_i))! 

ki\{n — ki)\ fe! kp\ 

_ n\ 

^ ki!k2l-‘kp!' 



3.52 52 cards are distributed to four players. How many hands are possible? 



3.3 Infinity 

3.3.1 The mathematical analysis of infinity 

Already Galileo Galilei (1564-1642) remarked that the squares of natu- 
ral numbers are as many as the natural numbers themselves. In Discorsi 
intorno a due nuove scienze he wrote 

Interrogando io . . . quanti siano i numeri quadrati, si pud con 
veritd rispondere, loro esser tanti quanti sono le proprie radici, 
avvenga che ogni quadrato ha la sua radice, ogni radice il suo 
quadrato, ne quadrato alcuno ha piu d ’una sola radice, ne radice 
alcuna piu d’un quadrato solo.^ 

^ Asked . . . how many are the squared numbers, one can verily say they are as many 
as the square roots, in fact every square number has its square root and every square 
root its square, nor any square has more than a square root nor any square root has 
more than a square. 
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Figure 3.11. The frontispieces of the Discorsi intomo a due nuove scienze by Galileo 
Galilei (1564-1642) and a book by Georg Cantor (1845-1918) about infinity. 



For a long time the only notion of infinity really accepted by mathemati- 
cians was Aristotles’s notion of potential infinity^ in the sense of never 
ending: the natural numbers as 0, 1 and so on is an example of poten- 
tial infinity. But the use of actual infinity was either avoided or used as 
a source of contradictions, and according to the mathematical and philo- 
sophical Greek tradition: infinitum actu non datur.^ 

But the development of mathematics, especially in the eighteenth and 
nineteenth centuries, led mathematicians to confront themselves not only 
with the idea of potential infinity in the sense of infinite processes as in 
the infinitesimal calculus, but also with the need of understanding the 
structure of infinite sets. 



a. Cardinality 

At the end of the eighteenth century Georg Cantor (1845-1918), in a series 
of papers, set the foundations of the theory of sets and, in particular, 
analyzed the concept of infinity on the basis of the principle of one-to-one 
correspondence. 

3.53 Definition. Two sets A and B are said to be equivalent or to have 
the same power or the same cardinality, and we write card A = card B or 
A By if and only if they are in a one-to-one correspondence with each 
other. 

It is easily seen that ~ is an equivalence relation, i.e., it is 



^ Actual infinity is not given. 
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(i) Reflexive. A ~ A, 

(ii) Symmetric. ^ ~ R if and only if R ~ A, 

(iii) Transitive, li B and B ^ C, then A ~ C. 

The equivalence classes are called the cardinals: if A belongs to the equiv- 
alence class a, we say that a is the cardinality of A and we write a := \A\ 
OT a := card A; and, existence of a cardinal a means the existence of a set 
A with card A = a. 

Sets A that have the same power of {0, 1, 2, 3, . . . , n — 1} are said to 
have cardinality n; the empty set has cardinality 0 by definition. This way 
natural numbers become cardinals: they are called finite cardinals^ all other 
are called transfinite cardinals. 

If A and B are disjoint sets of cardinality a and /3, the cardinality of 
AU B and of A x B depends only on a and (3 and is denoted by o + /? 
and a/?; in particular, if a = and /? = /?i then a + /? = oi + ^i. The 
cardinality a + /? agrees with the ordinary sum of integers if a and (3 are 
finite cardinals. 

If ^ ^ 0, the set of mappings from A into B is denoted by and 
its cardinality by (3^. If a and (3 are finite, then (3^ is the ordinary power 
with integers, see Section 3.2. 

The set of naturals N is infinite and its cardinality is denoted by ^^o 
(aleph, is the first letter of the Hebrew alphabet). A set that has cardi- 
nality Ko) that is in one-to-one correspondence to N, is called denumerable 
or countable. 

Of course a set has cardinality Hq if and only if it is possible to enu- 
merate it. 

3.54 Show that 

(i) card {2n | n G N} = card {2n + 1 1 n G N} = 

(ii) card {n G N | n > n} = 

(iii) n + = card ^{0, 1, . . . , n — 1} U =^o, 

(iv) ^o + Ho = Ko, 

(v) n Ko = for all n = 1, 2, 3, 

[Hint: Show a bijection between the sets involved. To prove (iii) notice that a bijection 
{0, 1, . . . , n — 1} X N ^ N is given by {i, /c) — > i -f nk.] 

3.55 Definition. Let a, f3 he cardinals. We say that 

(i) a < P if we can find sets A and B such that Ac B, card A = a and 
cardB = /?. 

(ii) a < P if a < P and a ^ p. 

Trivially card A < cardH if A C R. If A is infinite and J5 C A is finite, 
obviously, A\R is infinite. Consequently we can inductively choose ai G A, 
U 2 G A \ {ai}, G A \ {ai, a 2 }, and so on; therefore A contains a subset 
that has the same power of N. We conclude 

3.56 Proposition. A set A is infinite if and only if it contains a denu- 
merable subset. A cardinal a is transfinite if and only if a > 
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Figure 3.12. Two pages from two of Cantor’s papers about infinity that appeared re- 
spectively in 1895 and 1897 in Mathematische Annalen. 



If A is infinite, according to Proposition 3.56, we can write 

A = Ai{J A 2 with card = Kq and Aif] A^ = 0. 

Since A\ has the same power as a strictly included subset, A has the same 
cardinality as its proper subset U A 2 , hence we can state 

3.57 Proposition. A set is inGnite if and only if it has the same power 
as one of its proper subsets. 



b. Cantor-Bernstein theorem 

In principle two cardinals are not comparable. However the following the- 
orem states that a = /? if and only if a < /? and /3 < a. 

3.58 Theorem (Cantor-Bernstein). If A is equivalent to a subset of 
B and B is equivalent to a subset of A, then A and B are equivalent. 

Proof. Let h : A ^ B\ C B and k \ B A\ C A be the one-to-one correspondence 
between A and B\ h{A) C B and between B and Ai := k{B) C A, and let A 2 := 
k{Bi). Writing E F for card E = card F, by assumption A B, and, by construction, 
B rsj A 2 ; hence A 2 ~ A. The map (p := k o h : A ^ A 2 is one-to-one; set Aq := A and 

An+2 := <p{An), \fn > 1. 

Since A 2 C Ai C Aq, we have An+i C An Vn > 0, hence 
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^n+2 An "in 

and 

{An \ An-\-l) ~ (p{An \ An+l) = ^n+2 \ >^n+3- 
Since the sets A 2 j \A 2 j-\- 1 , j = 0, 1, . . . , are pairwise disjoint, also the subsets 

H := U^o {A 2 j \ A 2 j+^ , and ^{H) = (^A 2 j \ ^ 2 j-hi) 

have the same power. On the other hand trivially 

>1 = if U (Ai \ A 2 ) U n^oAi =:HUL, 

Ai = ip{H) U (Ai \ A 2 ) U n^oAi =: ip{H) U L, 
hence A ^ Ai, consequently A ^ B. □ 

An immediate consequence of the Cantor-Bernstein theorem and of 
Proposition 3.56 is 

3.59 Proposition. Kq is the first transGnite cardinal, i.e., a < if and 
only if a is finite. 

We notice that Proposition 3.59 does not follow directly from Proposi- 
tion 3.57 since a priori a < is not alternative to ^^o < ck- 



c. Denumerable sets 



Since the subset of primes in N is infinite, it is denumerable. Similarly the 
set 

<p^ p prime, n G N > 



is denumerable. Since A is in one-to-one correspondence with N x N, we 
infer 

Ko • Ko = Ho. 



More generally, the set 



{ 



PlP2---Pn 



Pi,P2,---,Pn prime 



} 



is denumerable, hence = Hq. The previous relation can be inferred 
by means of the following procedure known as the first Cantor diagonal 
method. Let In = be a countable family of denumerable sets. 

Enumerating Un/n as follows 



1213214S21 
Ui, tti, tt2, Ui, ^2) ^3? ^1? ^2? ^3? ^4) 



compare with Figure 3.13, we see that 



card /„ = Hq. 



3.60 f . Show that 

(i) n > 1, is denumerable; 

(ii) Q is denumerable; 

(iii) the set of polynomials with integer coefficients is denumerable; 

(iv) a real number is said to be an algebraic number if it is a root of a polynomial 
with integer coefficients. Show that the set of algebraic numbers is countable. 
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Figure 3.13. The first Cantor diagonal method. 



d. The £txiom of choice 

Let A be a set; a partial order on A, denoted by <, is a relation on A with 
the properties 

(i) Reflexive, x <x\/x e A, 

(ii) Symmetric, ii x^y e. A^ x <y and y < x, then x = y, 

(hi) Transitive, if x,y^z e A, x <y and y < z, then x < z. 

Notice that we do not require that necessarily either x < y oi y < x. If 
this last property occurs, we say that < is a (total) order on A. On a tree, 
there is a natural partial order, in R there is an order. The set of parts 
V(X) of a set X is partially ordered by the inclusion: If A, JB C X, then 
A^B E 'P(-X'), and we can say that A < B in V(X) if and only if Ac B. 

Because of the Cantor-Bernstein theorem, the relation < defines a 
partial order on the family of cardinals. Does it define a total order? In 
other words, given two ordinals, is it true that either a < (3 or (3 < al The 
question is equivalent to the following: given a set A with card A = a, and 
a cardinal /3, such that (3 > a does not hold, can we construct a subset 
B C A with cardB = (31 Of course the construction of B involves the 
choice of elements of A, and, in the case /? = we actually constructed 
a countable set B by induction, see Proposition 3.57. 

In 1904, Ernst Zermelo (1871-1951) showed, but we are not going to 
present the proofs here, that the answer is positive, i.e., the following 
theorem holds. 

3.61 Theorem. We have card .A < cardB or cardB < card A for every 
pair of sets A and B if and only if we admit the following axiom of choice. 



3.62 Axiom (Zermelo’s axiom). Let A be a family of nonempty and 
pairwise disjoint sets. Then there exists a set C such that C HA consists 
exactly of one element for each A E A. 

Nowadays Zermelo’s axiom is widely accepted as one of the standard 
mathematical tools, though in the years many attempts have been made 
to avoid its use in many mathematical theories. 

There are many equivalent ways of expressing Zermelo’s axiom and 
often the equivalence is not at all detectable at first sight. 
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3.63 Theorem. The following claims are equivalent 

(i) Zermelo’s Axiom 3.62. 

(ii) Axiom of choice. Let {Xi}i^i be a family of nonempty sets with 
indices in a set I . Then there exists a function choice (f defined on 
each i e I such that (p{i) e Xi e I, that is, we can choose an 
element Xi = if{i) in each Xi and consider the set {xi}i^j. 

(iii) Cartesian product. The Cartesian product of the family {Xi}i^i, 
Hi£l Xi, is empty if and only if one of the factors Xi is empty. 

Suppose a partial order < is defined on A, and let (7 C A. In this situation 
we can easily introduce the notions of upper bound, supremum and maxi- 
mum of C. An upper bound for C is an element m such that c < m^c e C; 
the supremum of (7 is, if it exists, the lowest of the upper bounds m of (7; 
Co G A is the maximum of (7 if cq G (7 and c < cq Vc G (7. Moreover, we 
say that cq is a maximal element for C if there is no 6 G A such that a<h 
and h ^ a\ finally, a totally ordered subset of C is called a chain of C. 

With the previous definitions we can now formulate two other equiva- 
lent forms of Zermelo’s axiom. 

3.64 Theorem (well-ordering). On every set X there is an order such 
that every nonempty subset has minimum. 

3.65 Theorem (Zorn’s lemma). Let A be a partial ordered set by <, 
and suppose that every chain of it has supremum. Then for every a e A 
there exists a maximal element x E A such that a < x. 



e. The power of the continuum 

3.66 Theorem (Cantor). Let A be a countable set. The family V{A) of 
subsets of A has the same power as the family 2^ of mappings (p : A 
{0, 1}, i.e., 2^0, and it is strictly larger than ^^o• 

Proof. The map that associates to each subset E C A its characteristic 
function Xe{^) (defined by %£;(a:) = 1 if x G £" and Xe{x) = 0 if x ^ E) 
clearly defines a bijection between V{A) and 2^. It remains to show that 
2^° > that is, one cannot enumerate the family of sequences with values 
0 and 1. We shall prove this using the second Cantor diagonal method. 
Suppose we are able to enumerate all sequences of 0 and 1. In this case we 
can form the table 

a\ al al ... 

af al al ... 

af al al ... ' 



where the a'j are either 0 or 1. We now define a new sequence with values 
in {0, 1} which is not listed in the previous table, a contradiction. For that, 
define 
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ifa| = 0, 
if 4 = 1. 

Clearly {xk} does not agree with any of the lines in the table. 0 

If we now observe that whenever card A> B C A with card B = Hq, 
then card A\B = card A, and if we represent the reals in a binary basis, we 
easily conclude that card [0, 1] = Also since tan(7r(a; — 1/2)), x €]0, 1[, 
is a bijection between ]0, 1[ and M, we infer that 

cardR = 2^° 

i.e., 2^0 is the power of the continuum. Finally by the first Cantor diagonal 
method we have card = 2^° , too. We can then summarize 

3.67 Theorem. The sets [0, 1], [0, 1]^, n > 2, and MP' have all the power 
of the continuum. 




3.68 The real numbers that are not algebraic (see, for example, Exercise 3.60) are 
called transcendental. Show that the set of transcendental numbers has the power of 
the continuum. 

The claim that the segment [0, 1] and the n-dimensional cube have 
the same power deserves a few comments. The claim in Theorem 3.67 
means that there exists a one-to-one map between [0,1] and [0,1]’^. As 
paradoxical as it may appear, it says in particular that the concept of 
power or cardinality and of dimension^ that is, in its intuitive form, the 
number of independent variables needed to describe a particular situation, 
are unrelated: for example it is not enough to describe an object in a one- 
to-one way with two parameters in order for it to be a surface. Actually 
the notion of dimension is related to more refined structures than just 
counting points, as, for instance, continuity. 



f. The continuum hypothesis 

More generally one shows that 2" > a for any cardinal a. This way we 
can construct a hierarchy of transfinite cardinals 

cardN < cardT^(N) < card (^(^(N))) < • • • . (3.16) 

The natural question of whether such a hierarchy exhausts all transfinite 
cardinals naturally arises. 

The hypothesis that the cardinality of the continuum is the smallest 
nondenumerable cardinal, i.e., that no other cardinal lies between and 
2^° is called the continuum hypothesis., while one refers to the assump- 
tion that the hierarchy in (3.16) exhausts all transfinite cardinals as the 
generalized continuum hypothesis. 

In 1939 Kurt Godel (1906-1978) showed that the generalized contin- 
uum hypothesis (in particular the continuum hypothesis) is consistent with 
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Figure 3.14. Bertrand Russell (1872-1970) and David Hilbert (1862-1943). 



the standard axioms of set theory; in 1963 Paul Cohen (1934- ) showed 
that also its negation (or the negation of the continuum hypothesis) is con- 
sistent with the same axioms. In other words, the continuum hypothesis is 
independent from the axioms of set theory and we can develop a set theory 
in which it is valid and a set theory in which it is not valid. 



3.3.2 Some information on the theory of sets 

In the second half of the eighteenth century mathematicians realized that 
a reasonable theory of sets was necessary for the development of mathe- 
matics. It is commonly agreed that the creator of the theory of sets was 
Georg Cantor (1845-1918) who developed the theory in many papers and 
made use of it in several contexts, and especially in the study of cardinal 
and ordinal numbers. 

At the same time as Cantor, Gottlob Frege (1848-1925) developed a 
formal theory of the higher order calculus of predicates. This theory may 
be regarded as a theory of sets based on two axioms: 

o Axiom of extensionality. Two sets are equal if they contain the 
same members. 

o Axiom of abstraction. Given a predicate p{x), there exists the set 
of X that satisfy p{x). 

Frege’s axioms in connection with sets that are too large lead to several 
paradoxes. Cantor himself observed that the set of all sets should have a 
maximum cardinality K contradicting the fact that 2^ > K. A similar 
observation had already been made by Cesare Burali-Forti (1861-1931) in 
connection with the theory of ordinals. In 1902 Bertrand Russell (1872- 
1970) observed that the axiom of abstraction is contradictory; in fact, if 
R is the set of all sets that are not members of themselves, then i? is a 
member of itself. 
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What is the reason for paradoxes and how can we avoid them? A first 
reason was found by J. Henri Poincare (1854-1912) and Bertrand Russell 
(1872-1970), and consists in the use of so-called impredicative notions, 
that is, in the use of quantifiers acting on all members of the set in order 
to define a new element. A first attempt to make up for this was the 
theory of types developed by Bertrand Russell (1872-1970) and Alfred 
N. Whitehead (1861-1947) in Principia Mathematica. However, excluding 
impredicative notions has some consequences, for instance the definition 
of the supremum for subsets of R is impredicative. Another reason for 
the occurence of paradoxes was seen by L. E. Brouwer (1881-1966) and 
the intuitionists in the principle of excluded middle (tertium non datur): 
either p or not p. They say that such a principle holds in correspondence 
of finite sets, but not in situations in which we use quantifiers on infinite 
sets. For the intuitionists the fact that “p(x) holds Vx” does not hold does 
not imply that “there is x such that p{x) does not hold.” They in fact 
interpret (or better pretend that one should interpret) the existence of x 
as the procedure or the exhibit of an x. On this basis the intuitionists 
started a program of reformulation of mathematics, that later on turned 
out to be of extreme relevance for information science, but doing that they 
also came to unsatisfactory conclusions such as, for instance, that every 
real function that exists in their sense is continuous. 

Nobody, or hardly anybody, is willing to give up Cantor’s results, as 
Hilbert put it “no one will expel us from the paradise which Cantor created 
for us,” and Bertrand Russell describes Cantor’s work as “probably the 
greatest of which the age can boast.” However, in order to compare two 
cardinals a and (3 (i.e., say whether p = a, a < /3 or (3 < a) Cantor had 
to assume that every set can be well-ordered, a counter-intuitive claim. 

In 1904 Ernst Zermelo (1871-1951) showed that every set can he well- 
ordered. In 1908 he analyzed the assumptions from which the theorem 
follows and which do not allow inference of the old paradoxes, though it 
does not answer the question of whether the new axioms would give rise 
to new paradoxes. 

Zermelo gives up Frege’s axiom of abstraction (which is contradictory) 
and replaces it with rules that produce admissible sets (by means of union, 
intersections and powers) and with a weaker form of the axiom of abstrac- 
tion, the 

o Axiom of segregation. Given a set X and a predicate p(x), there 
exists the subset {x e X \p{x)}. 

In the previous axiom p may be impredicative, i.e., may contain quantifiers 
on all X. 

With these axioms, Zermelo was able to produce finite sets such as 

{ 0 ,{ 0 },{{ 0 }},{ 0 >{ 0 }}} 

but cannot produce infinite sets. For this reason Zermelo assumes also 

o Axiom of infinity. There exists a set that contains the empty set and 
contains {x} if it contains x. 
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Figure 3.15. The frontispieces of a collection of sources of Mathematical logic and of a 
popular book about infinity. 



Notice that the axiom of infinity states, in terms of sets, the existence 
of the natural numbers as an actual infinite set and not just as a potential 
infinite set. Finally, Zermelo states 

o Axiom of choice. Given a family of disjoint nonempty sets {Xa}y then 
there exists a set C which has as its members one and only one element 
from each Xa^ 

which is crucial for the proof of the well-ordering theorem. 

The previous axioms, slightly modified by Abraham A. Praenkel (1891- 
1965), are Zermelo-Praenkel axioms of the theory of sets that allow one 
to prove the well-ordering theorem. Zermelo ’s idea is: if we accept those 
claims, then we also have to accept the well-ordering theorem. 

In contrast with the intuitionists, David Hilbert (1862-1943) started 
a new program. For Hilbert “forbidding a mathematician to make use of 
the principle of excluded middle is like forbidding an astronomer his tele- 
scope.” For Hilbert we need to distinguish between the formalism, which is 
finitistic, and interpretations of the formalism which may be nonfinitistic: 
for example, the calculus of polynomials is finitistic, but the interpretation 
of the formalism of polynomials as polynomial functions is nonfinitistic. 
Hilbert’s idea is then that formalism, being finitistic, always works, how- 
ever, a part of it has a meaning which is accepted by everybody, i.e., real 
sentences, but ideal sentences may have a meaning which is not unani- 
mously accepted, but, in any case, ideal sentences can be used to infer real 
sentences. FYom this point of view the fundamental question is that of the 
consistency of the system and the central question becomes the question 
of the consistency of the Zermelo-Fraenkel axioms. 
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Without getting into many details it is worth saying something more 
about the meaning of ideal and real sentences. Roughly speaking, a 
real sentence stands for a generalization of particular observations, thus 
‘‘p{x^y) holds Vo:,y G X” is a real sentence, while “Vx e X3y e X 
such that p{x, y) holds” is an ideal sentence. Corresponding to real and 
ideal sentences we have finitary and nonfinitary proofs. A finitary proof 
is essentially a proof that uses only arguments on finite sets or inductive 
arguments, while a nonfinitary proof is for instance one which makes an 
essential use of the axiom of infinity. 

In principle the proof of consistency should be finitary: for all d, d is not 
a proof of a contradiction. Therefore it should be possible not only to prove 
theorems of relative consistency (reducing the proof of the construction 
of the reals to that of rationals and, in turn, to that of naturals and of 
the theory of sets), but, in the context designed by Hilbert, it should be 
possible to prove a theorem of absolute consistency. Hilbert’s idea was that 
in order to prove consistency it suffices to find a property, that is satisfied 
by the axioms, preserved by the inference rules, but that is not satisfied 
by a contradictory sentence. This way the proof of consistency could be 
carried out by induction. 

Hilbert (as many other mathematicians) was worried by the fact that 
paradoxes, confined for the time to areas away from the kernel of mathe- 
matics, would enter the field of mathematical analysis, just refounded on 
nonfinitary arguments of set theory. On the other hand he firmly believed 
that every proof, even nonfinitary proofs, could be formally analyzed as 
a finite sequence of symbols on formulas, worked out according to precise 
synctactical rules (instead of as a fiow of ideas, meanings and concepts), 
and consequently could be handled in a finitary way. 

Hilbert’s program reached a crisis when in 1931 Kurt Godel (1906- 
1978) proved his celebrated incompleteness theorems, a consequence of 
which being that one can exhibit real sentences (in the sense of Hilbert) 
that can be proved only with nonfinitary means using the same axioms 
except the one asserting the existence of an infinite set. Later it was proved 
that one can exhibit a polynomial (in more than one variable) with integer 
coefficients without integer roots, a finitary claim, that however requires 
in an essential way the use of the axiom of infinity. 

But there is more. Modulo coding numbers, Godel proves that the claim 
of consistency of the Zermelo-Praenkel axioms is not provable not only 
with finitary but even with nonfinitary means. However, the sum of knowl- 
edge acquired leads and tranforms into the study of formal systems, that is 
into a new branch of mathematics, though, as Hermann Weyl (1885-1955) 
states, 

the question of the ultimate foundations and the ultimate mean- 
ing of mathematics remains open; we do not know in what direc- 
tion it will find its final solution or even whether a final objec- 
tive answer can be expressed at all. ^^Mathematizing” may well 
be a creative activity of man, like language or music, of primary 
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originality,”^ whose historical decisions defy complete objective 
rationalization. 



3.4 Summing Up 

Integer arithmetic 

Integral numbers 

The integral division of integers leads naturally to the notions of prime and coprime 
numbers and of greatest common divisor of two numbers as well as to Euclid’s algorithm 
for finding the greatest common divisor of two numbers. An extension, Euclid’s gener- 
alized algorithm, produces a solution (x, y) G I? of the equation ax-\-hy = g.c.d. (a, 6), 
which allows computation of all solutions of the linear equation with integral coefficients 

ax by = c, 

see Bezout’s theorem, Theorem 3.9. 

o Fundamental theorem of Arithmetic. Every integer n > 2 decomposes as a 
product of primes and, apart from rearrangement of factors, that decomposition is 
unique. 



Congruences 

Bezout’s theorem solves linear first order congruences modulo p: 

o ax = c (mod p) is solvable if and only if c is a multiple of g.c.d. (a,p). In this case 
one is able to find all the solutions, see Proposition 3.16. 
o ax = 1 (mod p) is always solvable with a unique solution x G {0, ... ,p — 1} if p is 
prime. Thus the ring of the remainders modulo p, Zp, is a field if p is prime. 
o Chinese remainder theorem. Given p\, P2, • • • ,Pn coprimes, the system 

X = bi (mod pi), 

X = 62 (mod P 2 ), 

x = bn (modpn) 

is solvable for any 61 , b 2 , ... ,bn, and two solutions differ by a multiple of pip 2 • ■ - Pn- 
The Chinese remainder theorem is often used to solve ax = b (mod n) when n is a 
product of distinct primes. 

A useful tool to analyze the multiplicative structure of the ring Zn , is the exponential 
modular function from Zn into Zn given by x ^ a^ (mod n) . We have 
o Fermat’s minor theorem. If p is prime, then a^ = 1 (mod p) Va G Zn, a 7^ 0. 
o Euler’s theorem. Denote by 0(n) the number of integers < n that are coprime with 
n. Then = 1 (mod n) for all a coprime with n. 

o Let p and q be prime. Set n := pq and let e and d be such that ed = 1 (mod <!>{n)). 
Then the two modular power maps from Zn into Zn given by 

X ^ a^ (mod n) and x ^ a^ (mod n) 
are one the inverse of the other. 

The latter sentence is the foundation of the RSA public key cryptography. 

^ And usefulness, we add. 
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Figure 3.16. Samplings in different models of counting. 



Combinatorics 

The table in Figure 3.16 summarizes the numbers of ordered and nonorderd samples in 
the various models of counting: arrangements, sets and maps, drawings and locations. 



Cardinals 

Cardinality is a way to count elements in a set. Two sets have the same cardinality if 
there is a bijection between them, and cardinals are simply the equivalence classes of 
sets which are in a one-to-one correspondence. 

One distinguishes sets with finite cardinality^ or simply finite, that is the sets which 
are in a one-to-one correspondence with bounded sets of N. The cardinality of such sets 
is just the number of elements they have. The other sets are called infinite, and their 
cardinals are said to be trans finite. Among these sets, the sets which are in a one-to 
one correspondence with N are called denumerable or countable, and their cardinality 
is denoted by These sets are obviously infinite. 
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The inclusion relation between sets define a relation on cardinals: a < (3 if and 
only if there exist sets A and B such that card A = a, card B = /3, and A C B. We also 
set a < j3 iff a < /3 and a ^ (3. The relation < between cardinals is obviously reflexive 
and transitive, while the symmetry is given by the 

o Cantor-Bernstein theorem. Let a and (3 be two cardinals. If a < ^ and P < 
then a = /?. 

It then follows 

o The relation < between cardinals is a partial order. 

o Let A C N. Then, either A is finite or it is in a one-to-one correspondence with N. 

Equivalently, Kq is the first transfinite cardinal, 
o Z, Q, and for n > 1, Z^, are denumerable. 

At the beginning of the nineteenth century, Zermelo showed that the partial order 
relation < between cardinals actually is a total order, that is, given sets A and B we 
have either card A < card B or card B < card A, provided we assume the following: 

o Zermelo’s axiom of choice. Let A be a family of nonempty and pairwise disjoint 
sets. Then there exists a set C such that C fl A consists exactly of one element for 
each A € A. 

Nowadays the axiom of choice is tacitly accepted, hence the possibility to compare 
different cardinals. 

o Cantor. Let a := card (A). Denote by 2°^ the cardinality of the set of all maps 
: A — ^ {0, 1}. Then 2^ > a. In particular 2^o > Ho- 
o [0, 1] C K, M and more generally, for every n > 2, E” have cardinality 2^o . it is 
therefore impossible to distinguish sizes and “dimensions” by counting points. 



3.5 Exercises 

3.69 if. Find a number that is divisible by 7 and that, divided by 2, 3, 4, 5 or 6, yields 
always a remainder 1. 

3.70 The least common multiple of two positive integers a and b is the least positive 
number that is divisible by both a and 6. It is denoted by l.c.m (a, 6). Show that 

l.c.m (a, 6) g.c.d. (a, 5) = ah. 

3.71 i[. If p and q divide a and g.c.d. {p, q) = 1, then pq divides a. 

3.72 f . Find the g.c.d. (a, b) and the l.c.m (a, 6) for each of the following pairs of inte- 
gers: 

(15000, 32768), (46035, 47430), (17795, 43291), (2295, 1989). 

3.73 Solve ax + by = g.c.d. (a, b), x,y £ h for the pairs (o, 6) that follow: 

(1542, 2102), (2287, 442), (1485, 1547), (38,127). 

3.74 f. Show that p is prime if and only if g.c.d. (a,p) = 1 for all a, 2 < a < p. 
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3.75 %, Let d E N, d > 2. Show that each n 6 N can be uniquely represented as 

k 

n = ao + 4- + • • • + 

3=0 

with 0 < Ui < d — 1 Vi, known as the representation of n in bases d. 



3.76 Let a and b be coprime. Show that 



i- = - + ^ 

ab a b 



with X, 2 / G Z. 

3.77 Show that every rational number r = pjq, p,q G Z, g 7 ^ 0, has a unique 
representation of the form 



r= — + — + •••+ — 

pCXi pOC2 pOik 

where ai, a 2 , . . . , Ofc are integer coefficients, and pi, p2, • • • , Pk are distinct primes. 

3.78 If p is prime, show that (a + b)^ = + 6^ (mod p). 

3.79 f . Show that 

(i) n is divisible by 3 if and only if the sum of its digits (in base 10) is divisible by 3, 

(ii) n is divisible by 9 if and only if the sum of its digits (in base 10) is divisible by 9. 

(iii) n = is divisible by 11 if and only if the alternate sum of its digits 

ao — ai + 02 — as -\ h (— l)^afc is divisible by 11. 

3.80 ff. Show that for every N > 1 there exists N consecutive numbers none of which 
is prime. [Hint: If p is prime and p > N, consider the numbers p! + 2, p! + 3, . . . , p! +p.] 

3.81 11. Deduce from the prime number theorem that, if pn is the n-th prime number. 



n^oo n/ logn 

3.82 1. Let {a/e} be a sequence of real numbers in binary representation. Cantor’s 
diagonal procedure then produces a real number a ^ {a^}. In particular, every sequence 
of algebraic numbers produces a nonalgebraic number. 



3.83 1. Find the probability that two persons chosen at random were born on a Mon- 
day. 

3.84 1. In how many different ways 

(i) can 8 persons be seated in 5 seats? 

(ii) can 5 persons be seated in 8 seats? 

3.85 1 Poker. Find the probability for a poker hand to be three of a kind or a full 
house. 



3.86 1 Mere paradox. Show that it is more probable to get at least one ace with 
four dice than at least one double ace in 24 throws of two dice. 
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3.87 f. Drawing successively with replacement c balls from n labeled from 1 to n, 
what is the probability of drawing k different balls? 



3.88 f . Prom a box containing n distinct balls, what is the probability that a sample 
of size k, obtained with replacement, contains two equal balls? 



3.89 % Birthdays. What is the probability that in a population of n people the birth- 
day of at least two people will fall on the same day, assuming equal probability for each 
day? Compute the probability for n = 10, 25, 50. Suppose n = 12; what is the probabil- 
ity that the birthday of the twelve people will fall in twelve different months? 



3.90 What is the probability that a random number between 1 and n divides n? 



3.91 Though Robin Hood is a wonderful archer (he hits the mark 9 times out of 
10), he faces a difficult challenge in this tournament. In order to win, he must hit the 
center of the mark at least 4 times with the next 5 arrows. On the other hand, if he hit 
the mark 5 times out of 5, the county sheriff would recognize him. Let us suppose he 
can miss the mark at will: what is the probability of his winning the tournament? 



3.92 f . A drug smuggler mixes drug pills with vitamin pills, hoping customs officers 
won’t find him out. Of a total of 400 pills, only 5% are illegal ones. If the officers check 
5 pills, what is their probability of finding an illegal one? 



3.93 A box contains 90 balls numbered from 1 to 90. We sample without replacement 
5 balls. What is the probability that they contain the balls 1, 2 and 3? Suppose we add 
three more balls numbered 1,2 and 3 to the original 90 balls. What is now the probability 
that after producing a sample of size 5 the trick is discovered? 



3.94 Let n>k>r>0 he natural numbers and let X be a set of cardinality n, 
|X| = n. Define 

:= {(A, B)\BCACX, \B\=r, |A| = k}. 

Show that 



3.95 f . Show that the number of strings of characters of k letters from an alphabet of 
n letters is n^. 

Show that the number of strings of characters with n letters from an alphabet A = 
{Ai,A 2 , . . . , Ar} where the letter A{ appears ki times with ki > 0 and ki = n is 

n! 

ki\k2\-‘kr\’ 



3.96 %. Show that 
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3.97 1. Let |X| = n and let Vk{X) := {A C X\ |A| = k) C V{X) be the set of 
fc-subsets of X and Ilfe(X) the set of fc-partitions of X, i.e., of subsets {Ai, ... ,Ak} of 
X which are disjoint and such that X = ^^—lAi. Show that 

|J(X, {1,2,..., k})\ = k\\Vk{X)l |5(X, {1,2,..., k})\ = k\\Uk{X)\. 

Moreover show that 



\Uk{x u {xo})| = k\Uk(x)\ + |nfc_i(x)|. 

[Hint: Notice that the fc-partitions of X U {xq} divide into the ones for which {xq} is 
one of the sets and the ones where xq is properly contained in one of the fc-subsets.] 



3.98 N balls numbered from 1 to iV are successively located in N cells numbered 
from 1 to AT starting from the first. What is the probability that a ball is located in 
the cell with the same number? [Hint: Compute first the probability that k balls are 
located in the corresponding cells.] 



3.99 11. Let El, ... , En be finite sets such that the intersection of k of them has 
always the same power, i.e., for alHi < 22 < * * * < we have \Ei^ H Ei^ fl • • • D Ei^^ \ = 
c{k). Show that 

i=l i=l 

In particular show that, if Pn denotes the family of permutations of n objects without 
fixed points, 

Pn = {cr e Pn\(Ti i} 

we have 

k=0 



[Hint: Write Pn\Pn — with Ei = {a e Pn\cri = i}.] 

3.100 11 Graphs. Many problems, both theoretical as well as of practical interest, 
often translate into graph problems. 

Definition. A (symmetric) graph with vertices V is a subset G of V xV such that if 
(u, v) e G, then {v, u) G G, and (v, v) ^ G for all v eV. 
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Figure 3.17. Passing from a tree P to a word tt. 



Two graphs (V,G) and {V',G') are said to be isomorphic if there is a bijection 
if \ V V such that (u, u) 6 G if and only if (ip{u) , ip(v)) € G'. Of course we can 
always decide if two finite graphs are isomorphic or not; however, the needed time can 
be very large in presence of many vertices: the best algorithms are just slightly more 
efficient than comparing the n! bijections from V and V' . 

To make the comparison more efficient, it is convenient to look at invariants. One 
such invariant is the number of connected components of a graph. 



Definition. The connected component of u in G is the set 

[v] := € y I 3uo,ui , ... ,Ur £ G 

such that uo = v, Ur — w, and (ui-i,Ui) € G Vi = 1, . . . , r |. 

The number of connected components of G, c(G), is clearly the same for isomorphic 
graphs. In particular, G is said to be connected if it has only one connected component: 
being connected is an invariant. 

Another invariant is the chromatic polynomial introduced by George Birkhoff 
(1884-1944) in 1912. 



Definition. A coloring of a graph {V, G) with x € N colors is a mapping x • ^ 
{1,2, ...,x} such that x(^) x(^) whenever (u,v) € G, that is, such that adjacent 

vertices are colored differently. 

The least number of distinct colors needed to color a graph is called the chromatic 
number, 7 (n), of the graph. Given a graph with n vertices and x > 7 (n), show that the 
number of coloring of G with x colors is given by a polynomial in x of degree n = \V\, 
called the chromatic polynomial of G 
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Figure 3.18. Passing from a word tt to a tree T. 



Pg(x) := ^ 
rcG 

where the sum is taken on all subgraphs F of G (including 0 and G).^ [Hint: Compute 
the number of wrong colorings.] 

3.101 % If Trees. A tree F is a connected graph without cycles, a cycle being a sequence 
of distinct vertices ui, . . . , Un, n > 2, with G G and (un,txo) € G. 

Theorem (Cayley). The number of trees with n vertices is the number of words with 
n — 2 letters from an alphabet of n letters, i.e., . 

Figures 3.17 and 3.18 show how to construct a word from a tree F and a tree from a 
word. 

^ There exist efficient algorithms to compute the chromatic polynomial of a graph. 
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3.102 The Pigeonhole Principle. Prove 

Proposition. Let X and Y be nonempty Gnite sets and let (f : X There exists 

y £Y such that the Gber contains at least |X|/|y| elements. 

Despite its simplicity, it is one of the most powerful methods of combinatorics. However, 
it is not always easy to understand how to use it. As an example of its applications we 
state, without proof, the following theorem. 

Theorem (Erdos— Szekeres). Let a, 6 € N, n := a6 + 1 and let xi,X 2 ,,Xn be any 
n-sample of real numbers. Then the sequence contains either an increasing sequence of 
a -h 1 numbers or a decreasing sequence ofb-j-l numbers. 
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3. Integer Numbers: Congruences, Counting and Infinity 



Sir Isaac Newton Gottfried von Leibniz 

(1643-1727) (1646-1716) 

Jacob Bernoulli Johann Bernoulli 

(1654-1705) (1667-1748) 

Michel Rolle Guillaume de I’Hopital Jacopo Riccati Guido Grandi Giulio Fagnano 
(1652-1719) (1661-1704) (1676-1754) (1671-1742) (1682-1766) 

Abraham de Moivre Brook Taylor James Stirling Colin MacLaurin 

(1667-1754) (1685-1731) (1692-1770) (1698-1746) 

Leonhard Euler 
(1707-1783) 

Jean d’Alembert 
(1717-1783) 

Daniel Bernoulli Alexis Clairaut Johann Lambert Etienne B’ezout 

(1700-1782) (1713-1765) (1728-1777) (1730-1783) 

Joseph-Louis Lagretnge 
(1736-1813) 

Pierre-Simon Laplace 
(1749-1827) 

Marie Jean Condorcet Adrien-Marie Legendre Lazare Carnot Sylvestre Lacroix 

(1743-1794) (1752-1833) (1753-1823) (1765-1843) 

Joseph Fourier 
(1768-1830) 

Figure 3.19. Infinitesimal analysis: a chronology from Newton and Leibniz to Fourier. 






4. Complex Numbers 



As already stated, the process of formation of numerical systems has 
been very slow. For instance, while Heron of Alexandria (lAD) and 
Archimedes of Syracuse (287BC-212BC) essentially accepted irrational 
numbers, working with their approximations, Diophantus of Alexandria 
(200-284) thought that equations with no integer solutions were not solv- 
able; and only in the fifteenth century were negative numbers accepted 
as solutions of algebraic equations.^ In the sixteenth century complex 
numbers enter the scene, with Girolamo Cardano (1501-1576) and Rafael 
Bombelli (1526-1573), in the resolution of algebraic equations £is surdes 
numbers, that is numbers which are convenient to use in order to achieve 
correct real number solutions. But Rene Descartes (1596-1650) rejected 
complex roots and coined the term imaginary for these numbers. Despite 
the fact that complex numbers were fruitfully used by Jacob Bernoulli 
(1654-1705) and Leonhard Euler (1707-1783) to integrate rational func- 
tions and that several complex functions had been introduced, such as 
the complex logarithm by Leonhard Euler (1707-1783), complex numbers 
were accepted only after Carl Friedrich Gauss (1777-1855) gave a convinc- 
ing geometric interpretation of them and proved the fundamental theorem 
of algebra (following previous researches by Leonhard Euler (1707-1783), 
Jean d’Alembert (1717-1783) and Joseph-Louis Lagrange (1736-1813)). 
Finally, in 1837 William R. Hamilton (1805-1865) introduced a formal 
definition of the system of complex numbers, which is essentially the one 
in use, giving up the mysterious imaginary unit \/^. Meanwhile complex 
functions reveal their importance in treating the equations of hydrody- 
namics and electromagnetism, and, in the eighteenth century develop into 
the theory of functions of complex variables with Augustin-Louis Cauchy 
(1789-1857), Karl Weierstrass (1815-1897) and G. F. Bernhard Riemann 
(1826-1866). 



^ For example, Antoine Arnauld (1612-1694) questioned that —1:1 = 1:— 1 by- 
asking how a smaller could be to a greater as a greater to a smaller. 
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Figure 4.1. Girolamo Cardano (1501-1576) and Niccolo Fontana (1500-1557), called 
Tartaglia. 



4.1 Complex Numbers 

The development of the notion of complex numbers goes through their 
use in algebraic and differential problems and the understanding of their 
geometric properties. An a posteriori motivation is that they allow the 
solution of algebraic equations, as for instance + 1 = 0, that is not 
solvable in M. 



a. The system of complex numbers 

4.1 Gauss plane. The set of complex numbers, denoted C, is the Gauss 
plane^ that is the Cartesian plane with the operations of sum, 

(a, b) + (c, d) := (a + c, 6 + d), 

that is, the usual rule of summing vectors in R^, and of product, defined 
by 

(a, 6) • (c, d) := (ac — bd, ad + be). 

If we identify the axis of abscisses with R in such a way that (0, 0) = 0 and 
(1,0) = 1, and we introduce the imaginary unit i to indicate the vector 
(0, 1), we see, on account of the computation rules previously defined, that 
= i . i = — 1 , that = (a, b) = o(l, 0) + 6(0, 1) is written z = a-\-ib, 
and that the product of two complex numbers is written as 

(a + ib){c + id) = ac + iad -h ibc + i^bd = {ac — bd) + i{ad + be). 

It is easily seen that the properties (A), (M) and {AM) of the real 
system R, relative to the sum and the product, continue to hold in C, 0 
and 1 being this time respectively 0 := 0 + zO, 1 := 1 + iO. The inverse l/z 
of the complex number z = x -\-iy then given by 
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Figure 4.2. The frontispiece of the Lec- 
tures on Quaternions by William R. 
Hamilton (1805-1865). 
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L I tri 



1 X — iy X — iy x . V 

x-\-iy {x iy){x — iy) x^-\-y^ x‘^-\-y^ x^-\-y‘^' 

Consequently we can summarize saying that C is a commutative field. 
Moreover, since the sum and product of complex numbers reduce to the 
sum and product of real numbers on the real axis, we can state that R ~ 
{x iy G C\y = 0} is a subfield of C. 

Of course there are several ways of ordering complex numbers; for in- 
stance, we can order them lexicographically: (a, 6) (c, d) if a < c or 

a = c and b < d. However, none of all possible orderings is compatible 
with the field structure of C and the order of R. Otherwise, we would have 
either i)^0orz-^0, asi^O being 0 G R and i ^ R, thus, in both cases 
— 1 = > 0: a contradiction. For this reason inequalities between complex 

numbers are meaningless. 

4.2 Conjugation. Let z := a -h ib E C. The numbers a and 6, denoted 
also a =: and b =: are called the real part and the imaginary part 



ib 


i z 


', = a -\-ib i 






f 


z + w 








. • 


i 


- 








1 i 








1 a 





Figure 4.3. The sum of complex numbers. 
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Figure 4.4. (a) \z\ and the argument 0 of z. (b) Relative locations of ±z and 

± 2 . 



of 2 ;. The conjugate of 2 ; is defined by 

z := a — — i^z. 



Of course SRz = SRz and 9z = — Sz. Consequently z is the symmetric of z 
with respect to the real axis. The symmetric point of z with respect to the 
imaginary axis is — z, and the symmetric of z with respect to the origin is 
— z. Moreover, 



5Rz = 



z + z 
2 ’ 



3z = 



z — z 
2i * 



4.3 f . Show that 




z w = z 



(-) = = iftu/O. 
\w ' w 



z • w = z 



w, 



4.4 Absolute value or modulus. Let z = a + G C. Its absolute 
value, or modulus, is defined as the nonnegative real number 

|z| := = \/(3fiz)2 + ($>z)2. 

Clearly |z| is the Euclidean length of z in the Gauss plane (i.e., the distance 
between z and the origin) and agrees with the modulus in R if z is real. 
Clearly 

(i) |z| > 0, |z| = 0 if and only if z = 0, 

(ii) Triangle inequality. |z + u;| < |z| + 

4.5 f. Show that for all z and w eC the following hold: 

|2p = 22, \ZW\ = \z\ |w|, |2| = \z\, 

|SR2|<|2|, | 92 | < I2I, ||2|-|w|| < l2-ty|, 
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4.6 Hermitian product. The Hermitian product of 2 ; and w is simply 
zw. If w = a-\-ib and z = c + id^ then 



zw = {c-j- id){a — ib) = {ac + bd) + i{ad — be) 



from which we easily infer 
o zz = x‘^ -{-y^ = |zp, 

o ^{zw) is the scalar product of z and u; in in particular z and w are 
perpendicular if and only if 5R(zuJ) = 0, 
o the area of the triangle T with vertices 0, z and w is 



Area(T) = ^\ad — bc\ = ^ ^{zw) 



For the last claim, denote by the angle between the segments Oz and Ow 
at the origin, and recall that ac + = kl I'l^l cosc^ and that Area(T) = 

|z| |u;| I sin(^|. Therefore 



Area(T)^ = |zp|'o;p(l — cos^ (p) 

= (c^ + d^){a^ + b^) — {ac + bd)^ = . . • = {ad — bc)^. 



(4.1) 



4.7 Polar form of complex numbers: Argand’s plane. A complex 
number z = a + i6€C,z^0, can be represented in polar coordinates 
(r, 6) with center at the origin, r being the modulus of z and 9 the angle 
between the real positive axis and the half-line from the origin through z, 
“measured counterclockwise and in radiants,” that is, 

a=|z|cos0, 6=|z|sin0 (4.2) 

i.e., 

z= |z|(cos^ + isin^). (4.3) 

Notice that the previous equality holds for all z G C. It is the polar repre- 
sentation of z. The number 6 is called the argument or phase of z. Clearly, 
9 is determined by z up to an integer multiple of 27 t, in particular 9{1) = 0 
modulo 27 t. The argument of z, denoted by Arg(z), must be understood 
as a multivalued function and not as a real- valued function. If we insist in 
considering the argument of z as a function from C \ {0} to R, we have to 
choose a determination: that is, an interval [a, a+27r[ where a unique value 
of the angle must be read. The restriction of the argument to this interval, 
called a determination of the argument^ is denoted by arg^“^(z). Among 
all determinations, two standard choices are a = 0, that is 0 G [0, 27t[, 
often called the principal determination, commonly denoted by arg z, and 
a = — 7T, that is 0 G [— 7r,7r[. 

However, choosing a determination has drawbacks: first we have a dis- 
continuity of the argument function arg (z) along the half-line through 
the origin that forms an angle a with the positive x-axis, where a jump 
of 27 t between the values of the two sides of the half-line exists; secondly, 
addition formulas just do not hold for a determination, we in fact have 
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Figure 4.5. Multiplication of complex numbers. 



arg ^'^\ziZ 2 ) = arg + axg - c 



where 



Jo if 2a < axg + arg < 2a + 27 t, 
1 27 t if arg + arg (®^Z2 > 2a + 27 t. 



4.8 Show that 



arctan ^ 

X 



9{z) = I 



7T 

2 

arctan ^ 



arctan ^ — tt 

X 



7T 

2 



if X > 0, 

if a: = 0 and 2 / > 0, 
if X < 0 and 2 / > 0, 
if X < 0 and y <0^ 
is X = 0 and 2/ < 0? 



where z = x iy/\s the determination of the argument on [— tt, 7t[. 



4.9 Multiplication in polar coordinates. If z = p(cos 0 + z sin 0) and 
w = r(cos77 + isin?;), on account of the addition formulas for the trigono- 
metric functions and of the rule of multiplication for complex numbers, we 
get 



zw = pr [(cos 6 cos rj — sin 6 sin rj) + i(cos 0 sin r/ + cos rj sin 6)] 

= pr[cos{6 + r/) + i sin{9 + rj)]. 

That is, the modulus of the product is the product of moduli, while the 
argument of the product is the sum of the arguments of the factors. Ge- 
ometrically, multiplying a vector z eChy w := \w\{cos rj -j- i sin rj) means 
dilating the vector by a factor \w\ and rotating it anticlockwise through 
an angle rj. For instance iz is the anticlockwise rotation of z by 90 degrees. 
Thus dilations and rotations for plane geometry can all be expressed by 
complex multiplication, a useful fact in plane geometry (see, for example. 
Section 4.3.2). 
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4.10 de Moivre’s formula. A trivial consequence of (4.4) is that 

^ (cos 20 + i sin 20) 

if z = p(cos 0+2 sin 0), and, proceeding by induction, the following formula, 
de Moivre formula, holds for every n G N, 

z'^ = (cos n0 + 2 sin nO ) . (4.5) 

4.11 Complex exponential. Set /(0) := cos0 + 2sin0, 0 G R. The 
multiplication rule yields the formula 

m)/(02) = /(01+02) V01, 02 GR, 

which is analogous to We then define the complex expo- 

nential as the map : C — > C also denoted by exp , given by 

e^ = exp (. 2 ;) := (cos ^ 2 ; + 2 sin 9^;) , (4.6) 

e being Euler’s number. It is readily seen that 

= Vz,2i;GC, 



Clearly, if z is real, the complex and real exponential (with base e) agree; 
the novelty is in the definition 

= cos 0 + 2 sin 0, (4.7) 

which allows the use of an exponential notation for the trigonometric func- 
tions sin0 and cos 6, Notice the very famous Euler identity 

e^^ = - 1 . 

Actually, observing that for all 0 G R we have 

= cos 0 + 2 sin 0, = cos 0 - 2 sin 0, 

we easily infer the following Euler’s formulas: 

COS0 = , sin0 = . 

2 ’ 22 

Finally, observe that (4.7) allow us to write any complex number in the 
shorter polar form 



Vci G R, \fz G R. 
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b. The n-th roots 

4.12 Proposition. Let w e C, w ^ 0, n e N, n > 1. The equation 
= w has exactly n distinct roots zo,zi,. . . , z„_i given by 



. /.arg(w) +2fc7T\ , ni 1 

2 :*; := |w| ' exp j k = 0,l,...,n — l. 

Proof. If 2 ; is a root of 2 ” = w, then 

|z|” = |tt;| and Argz” = Argza. 

The first equality yields |z| = while the second yields 

Argw + 2kn 



(4.8) 



Aigz = 



n 



k G ^5 



as Aigz'^ = nArgz. Therefore 

, il/n /'.aig{w) + 2kTT\ W 7 , ^^7 

z = \w\ ' exp j , vk elj. 

Since the values (arg {w) + 2k7r)fn repeat periodically with period 27r/n, 
the only distinct values in [0, 27 t[ correspond tofc = 0,l,...,n-l. Thus 
we conclude that z ought to be one of the z^’s. Finally, one checks that all 
the ZkS are solutions of z'^ = w. □ 

The n distinct solutions zq^Zi^. . . , Zn-i of the equation z'^ = w in (4.8) 
are called the complex or algebraic n-th roots of w. 

Proposition 4.12 applies also to i/; = a G M. If a > 0, we have a = |a| = 
|a|exp (i • 0), thus 



( 2'7t k \ 

fc = 0, 1, . . . ,n — 1. 

If a < 0, we have a = |a|exp (in), hence 

^ = |a|^/’^exp (z ^ ^ ~~~) fc = 0, 1, . . . , n — 1. 

In particular, for a > 0, we rediscover the arithmetic root correspond- 
ing to fc = 0, and, in case n is even, for k = n/2 we also find 

^ = a^/’^(cos7T + isinTr) = — 

as n-th root of a, that is the negative real n-th root of a. If a < 0 and n is 
odd, n = 2/i + 1, then foi k = h ^ = |a|^/’^(cos7r + isinTr) = is 

one of the complex roots and is the only real n-th root of a. 

Notice that for every fc = 0, . . . , n— 1 the argument of Zk is the argument 
of Zk-i plus 27r/n. Consequently, the n-th roots of a complex number w 
represent a regular n-sided polygon, inscribed in a circle at 0, with radius 
and one vertex at 
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Figure 4.6. The 5th roots of a complex number. 



4.13 Roots of unity. In particular Proposition 4.12 yields that the n 
solutions ofz^ = l are given by 

uJn,k •= exp ) fc = 0, . . . , n - 1. 

Let (jj := (Jn,i = Then the n-th roots of unity are the numbers 

a;, , . . . , . (4.9) 

Notice that obviously = 1. Moreover, comparing with (4.8), if := 
\w\^^'^exp (iarg {w)/n), then the n-th roots of w can be written as 

zi, ziu;, zio;^,..., (4.10) 



c. Complex exponential and logarithm 

The exponential function z is actually a map from C into C. Observe 
that ^ 0 everywhere since |e^| = ^ 0. 



4.14 Proposition. The complex exponential is periodic with period 27ri, 
i.e., 

exp {z -f 27ri) = exp {z) \/ z e C. 

Moreover for any a G M, the restriction of the exponential map to 



I^:= eC a < Qz < a + 7 t| 



is a bijective map onto C \ {0}. 



Proof. Formula (4.7) yields 

exp (i(y -f 2k7r)) = exp (ip) Vy G M, 

that is, is 27ri-periodic. Fix a G R and assume that i e., 

^zi-z 2 _ I Then z\ — Z 2 = 2km, from which we infer Zi = Z 2 , since 
Zl,Z2eIa- □ 
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Given z £ C, z ^ 0, every w E C such that = z is called a natural 
logarithm of 2 ;. More precisely, 

4.15 Definition. For any a 6 M, The inverse function of the restriction 
of the complex exponential to 

la •= ^z^a <^z < a-\- 27t| 

is called a determination of the complex logarithm, 

log(“) : C \ {0} ^ /a c C. 

When a = — tt, we denote w by log w and call it the principal 

logarithm. 

By definition 

= VwGC\{0} 

and 

log^“^(e^) = 2 ; if and only if z e la- 

4.16 Proposition. For any a eR and w E C \ {0} we have 

log^“^ w — log \w\ + i arg (4-11) 

Proof. Let z : x^iy = log^"^ w. Then w = and z E la if and only if 

iw = e^ 

\a <y < a-\-27T 

from which we infer x = log |tt;| and y = arg = arg ^^^z. □ 

4.17 Example. Since i = cos ^ + isin i.e., argi = ^, we have logi = if or i* = 

e~'^ . 

On account of Proposition 4.16, all determinations of the logarithm 
are discontinuous as the corresponding determination of the argument. In 
particular log^“^ w is singular along the half-line that has an angle a with 
the positive x-axis, with a jump of 27ri along this half-line. Also, some care 
is necessary to compute with logarithms since the argument of a product 
is not in general the sum of the arguments. In fact, if G C \ {0}, we 
have 

log^ (zw) = log^“^ z + log^“^ w -ic 

Jo if arg {z) -h arg (it;) < 2a + 27 t, 

\27t if arg {z) -}- arg {w) > 2a -f 27 t. 



if and only if 



|it;| = e^, 
w 

e^y = —,,a <y < a-\- 27 t , 
w\ 



where 
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Figure 4.7. The frontispieces of a treatise on functions of complex variables by 
G. F. Bernhard Riemann (1826-1866) and of a popular book about \/^. 



4.2 Sequences of Complex Numbers 

a. Definitions 

The limit of a sequence of complex numbers is defined similarly to the real 
case. 

4.18 Definition. Let {zn} C C be a sequence of complex numbers. We 
say that {zn} converges to the complex number zq if \zn — zq\ -^0, that is 

Ve > 0 3 n such that \zn — zq\ < e \/n > n. 

We say that {zn} C C diverges if\zn\ +oo as a sequence of real numbers. 

As in the real case 

4.19 Definition. We say that a sequence {zn} C C is a Cauchy sequence 
if Ve > 0 there is n such that \zn — Zm\ < ^ for all n^m>n. 

Prom the inequalities 

kl, |y| < 1^1 = y/x^ + y^ < |x| + |y| (4.12) 

for all 2 : = X + it/ G C, we easily infer 

4.20 Proposition. Let {zn}, Zn := Xn + iyn, be a sequence of complex 
numbers, and let zq := xq + it/o ^ C. Then 

-> xo and yn t/o- 



(i) Zn zo eC if and only if Xn 






132 



4. Complex Numbers 



(ii) Zn is a Cauchy sequence in C if and only if {xn} and {y^} are Cauchy 
sequences in R. 

Prom Proposition 4.20 we easily infer for instance 

4.21 Proposition. Let {zn} and {i^n} be two sequences of complex num- 
bers such that Zn z E C and Wn w E C. Then 

(i) Xzn + pwn Xz-h pw for all A, /z G C, 

(ii) ZnWn zw, 

(iii) if Wni^Q and it; ^ 0, then Znfwn zjw^ 

(iv) |zn| ^ kl, andzn ^ 

4.22 Prove Propositions 4.20 and 4.21. 

Finally, we have the following. 

4.23 Theorem (Cauchy test). A sequence of complex numbers con- 
verges if and only if it is a Cauchy sequence. 

This follows again from Proposition 4.20 if we take into account the Cauchy 
test for real sequences. For the same reason the Bolzano-Weierstrass the- 
orem, Theorem 2.43 extends to complex sequences 

4.24 Theorem (Bolzano-Weierstrass). Any bounded sequence of com- 
plex numbers has a convergent subsequence. 



b. Weierstrass’s theorem 

At this point we could introduce the notions of limit and of continuity for 
functions of a complex variable and develop a theory similar to the real 
case. Instead, we prefer to postpone this study in the context of metric 
spaces. Here we confine ourselves to defining continuity for functions / : 
C ^ R. A function / : C — > R is said to be continuous at zq if for every 
sequence {zn} converging to zq we have f{zn) /(^o); and / is continuous 
if it is continuous at each zq E C. Finally we say that f{z) — > +oo as 
\z\ ^ +00 if VM > 0 there exists r > 0 such that f{z)>M for all 2 ; such 
that \z\ > r. 

4.25 Theorem (Weierstrass). Let / : C R be a continuous function 
such that f{z) — > +00 as \z\ — ^ 00 . Then f attains its minimum at a point 
zo E C. 

Proof. Let L := inf{f(z) | 2 : E C}. It suffices to show that /(zq) = L. Prom 
the characterization of the infimum we infer —00 < L < + 00 , the existence 
of a minimizing sequence {pn} C /(C) C R such that pn ^ L and of a 
sequence {zk} C C such that f{zk) = Pk> The sequence {zk} is bounded, 
otherwise we could find a subsequence {zn^} of {zk} with \zuk\ 00 , 
hence /(^n^) ^ + 00 . Since pnk = fi^uk) -b, we would get L = + 00 : 
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a contradiction. The Bolzano-Weierstrass theorem, Theorem 4.25, then 
yields a subsequence {zn^} which converges to some zq G C. We then 
have yrik = fi^uk) /(^o)- Since by construction yn L, we conclude 
L = /(j 2 o), as wanted. □ 



4.3 Some Elementary Applications 

In this section we present a few elementary applications. 



4.3.1 A few applications of the complex 
notation 

When dealing with trigonometric formulas, but actually in many instances, 
the complex notation simplifies computations a great deal. 



4.26 Uniform circular motion. Recall that the harmonic motion of a 
point P{t) on the unit circle of R^, that starts at t = 0 from (1,0) with 
angular velocity ct;, is given by 



x{t) = cos 
y{t) = sincjt. 



see, e.g. Proposition 6.25 of [GMl]. Thus, introducing complex notation, 
the uniform circular motion on the unit circle is described by the function 
P : R — ^ C given by P{t) = This formula already appears as a great 
simplification of the description of the harmonic motion on the unit circle. 

However the simplification that can be obtained using complex numbers 
and complex notation is even more evident if one notices that it is easier to 
compute with powers than with sine and cosine. For instance, if we define 
for z{t) : R C, z{t) = x{t) + iy{t), 



z\t) = Dz{t) := x'{t) + iy\t), 

pt pt pt 

/ z{s)ds:= / x{s)ds-{-i / y{s)ds, 
Jo Jo Jo 



then we have 







^iujt _ I 

iu 



for all a; G R. Clearly these formulsts are handier than D{e^^ cos{bt)) = 
cos{bt) — sin(6t), and D{e^^ sin(6t)) = sm{bt) + be^^ cos(6t), or 
the corresponding formulas for the primitives. 
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4.27 Example. Euler’s formulas yield that the sine and cosine function are just a 
superposition of two uniform circular motions with opposite angular velocities of ±1. 

4.28 Complex solutions of the oscillation equation. Consider the 
differential equation 

ax"(t) + bx'{t) + c = 0 

and look for solutions x C. The characteristic equation 

dX^ -|- bX “h c = 0 

has two roots Ai and A 2 that are either distinct or equal. In the first case, 
Ai ^ A 2 , and are solutions and, on account of the principle of 
superposition 

+ C2e^^*, Cl, C2 G C 

are solutions, too. In the second case, Ai = A 2 =: A, the functions and 
are solutions as well as all functions of the type 

(ci + C2t)c^*, Cl, C 2 G C. 

Exactly as in the real case (see, e.g.. Section 6.1.3 of [GMl]), one can 
then conclude that, in fact, these are all solutions. 



4.29 Prostapheresis formulas. In complex notation they are writ- 
ten as 

gia + = 2 cos ^ , 



e®“ — = 2i sin e® “ 2 ^ 



and can be easily deduced. In fact, writing 

a+P a-P a+P a~P 

“ = — + 2 “ 

it suffices to note that 

'e®“ + e®^ = e®^{e®^ +e“®^}, 

^e®“ - e®^ = e®^{e®^ - e“®^}. 
Therefore they are a trivial consequence of Euler’s formulas. 



(4.13) 



4.30 Beating phenomenon. This is a phenomenon which occurs when 
we sum two sinusoids of slightly different pulses, compare, e.g., 6.13 of 
[GMl]. As a simple example, let us show that the same phenomenon 
appears when we sum sinusoidal signals with different amplitudes. If 
fi{t) = Aicos{uit + (fi), f 2 {t) = A 2 cos(a; 2 t + ^^ 2 ), we have 

flit) = 3i(cie®“®*), f2(t) = 5R(c2e'“’=‘) 

where Ci = and C 2 = ^ 26 ^^^^^ Hence 
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Factoring out the mean oscillation we then find 

+ C2C*“=‘ = |cie’"^‘ + 026 ”* . (4.14) 

The explicit computation of the real part is of course complicated, but, 
without performing such a computation, we see that we are in the presence 
of a signal with pulse and amplitude varying periodically with pulse 

2 



4.3.2 A few applicatons to elementary 
Euclidean geometry 

Translations, dilations and rotations are the typical transformations of 
Euclidean geometry of the plane. As we have seen, after introducing an 
orthonormal reference frame, they have a natural algebraic counterpart in 
the operation of sum and product in the Gauss plane. Therefore it is not 
surprising that the use of complex numbers permits a particularly simple 
algebrization of the geometry in the plane. 



4.31 Straight line through two points a^b eC. The point z e C 
is in the line through a and b if and only if 2 : — a is a real multiple of 6 — a, 
equivalently if and only if {z — a)/ (b — a) is real, that is 

z — a z — a ^/z-a\ 

= = , or 9 =0. 

b — a b — a \b — a/ 

Consequently the two open half-planes bounded by that line are described 

4.32 Perpendicular lines. Let a, 6, c be three distinct points in the 
plane. Since an anticlockwise rotation by 90 degrees translates into a mul- 
tiplication by i, the lines through a and b and through a and c are perpen- 
dicular if and only if c — a/6 — a is purely imaginary, that is 



c — a 
b — a 



c — a 
b — a 



or 



5R((c - a)(6 - a)) = 0. 

Consequently the line through a and perpendicular to the line through a 
and 6 is _ 

IzeC (6 — a){z — a) + (6 — a){z — a) = o|. 
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4.33 Similarity of two triangles. Let Si^S2,ss G C and ^ C 

be the vertices of two triangles S and T. We recall that S and T are similar 
(with the ordered vertices) if the dilation and rotation which moves t2 — h 
to S2 “ ^1 moves also — t\ onto S3 — si. Since rotations and dilations 
translate into a complex multiplication, S and T are similar if and only if, 
for some 6 G C we have W2 — W1 = b{z2 — zi)^ then also W3 — W1 = b{zs — zi)^ 
that is, 

Z3 - zi _ ws - Wi 

Z2 — Zi W2 —Wi' 



a. Special points of a triangle 

4.34 Circumcenter. The perpendicular bisectors to the three sides af an arbitrary 
triangle meet at a point. It is called the circumcenter of the triangle, and it is the center 
of the circumcircle of the triangle. 

Let a, 6, c G C be the vertices. The middle points of the three sides are respectively 
(a + 6)/2, (a + c)/2 and (6 + c)/2. The equations of the bisectors are consequently 

(h - c)z H- (6 - c)z = |6p - |cp, 

^ (c - a)z + (c - a)z = |cp - |ap, 

(a — h)z 4- (a — h)z = |a|^ — |6p, 



that, solved in z^ give for the circumcenter 

|a|2(6-c) + |6|2(c-a) + |c|2(a-6) 

Q — — — — . 

a(6 — c) + b{c — a) H- c(a — b) 

4.35 Barycenter or centroid. The three medians (the lines connecting each vertex 
to the middle point of the opposite side) meet at a point. If a, 6, c are the vertices, 
(6 + c)/2, (a + c)/2 and (a-|-6)/2 are the midpoints of the corresponding opposite sides. 
The intersection point of two medians can be obtained solving in A, /x G M the system 

fz = Aa + (l-A)^, 

Subtracting, we easily infer, since the triangle is nondegenerate, that the previous system 
has one solution given by A = /x = 1/3, thus concluding that the barycenter is 

CL b c 

z belonging also to the third median. Notice that is is easy now to prove that the 
intersection of the medians is two thirds of the way along from each of their vertices. 

4.36 Orthocenter. The altitudes of a triangle (that is the perpendiculars dropped 
down from vertices onto the opposite side) intersect at a single point: the orthocenter. 
Fix a reference with origin at the circumcenter o of the triangle, and, in this reference, 
let a, 6, c G C be the three vertices of the triangle, so that |a| = |6| = |c|. We claim that 
the point 

p = a + 6 + c 

is the orthocenter. In fact, since p — a = 6 + c, and |6| = |c|, p — a is perpendicular to 
the side be. By simmetry, p is also on the perpendicular from b to ca and from c to ab. 
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Figure 4.8. (a) Euler’s line, (b) The nine-point circle. 




4.37 % Incenter. Show that the three angle bisectors of a triangle meet at a point 
called the incenter. 



4.38 Euler’s line. In any triangle, the circumcenter, the orthocenter and the barycen- 
ter lie on a straight line. In fact in a reference in which the circumcenter is the origin, 
the barycenter m and the orthocenter are respectively 

m = i (a -h 6 -h c) and p = a -f 6 + c, 

by 4.35 and 4.36. 

We also have the following theorem due to Karl Feuerbach (1800-1834), but proba- 
bly already known to Charles Brianchon (1783-1864) and Jean-Victor Poncelet (1788- 
1867). 



4.39 Theorem (The nine-point circle). Let a, 6, c G C be the vertices of a triangle 
that for convenience we think to be inscribed in a unitary circle, i.e., that is \a\ = |6| = 
|c| = 1. Let q be the midpoint of the segment connecting the circumcenter and the 
orthocenter, that is, in the chosen frame, 

a b c 

Then the circle with center q and radius 1/2 goes through 

(i) the midpoints of the three sides, 

(ii) the midpoints of the segments joining the orthocenter with the three vertices, 

(iii) the feet of the three perpendiculars from the vertices to the opposite sides. 



Proof. It suffices to show that each of those points has distance 1/2 from q. The distance 
of the midpoint of be from q is 



Q- 



b + c\ 



The midpoint of the segment joining the orthocenter p with a is (a -f- (a -|- 6 -|- c))/2 = 
(a -I- 2q)j2, hence 



q- 



a 2q\ 



Finally, one sees that the foot of the perpendicular from a to be is 



a -h 6 -H c 6c ^ 
2 
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Figure 4.9. Napoleon’s theorem. 



hence 



q-rj 



be _ |6| |c| _ 1 
2a 2\a\ 2 



□ 



b. Equilateral triangles 

4.40 Proposition. Let zi,Z2, zs be the vertices of a triangle (listed anticlockwise) and 
let uj := exp (i27r/3) be the second of the 3rd roots of unity. Then the triangle ziZ2Zs 
is equilateral if and only if 

Zi -f UJZ2 + UJ^Z3 = 0. 



Proof. In fact, Z1Z2Z3 is equilateral if it is similar to the triangle of the 3rd roots of 
unity, Then, according to 4.33, 



Z3 - zi 
Z2 - Zi 




= a; + 1, 



i.e., Z 3 + ujzi — (a; + l)z 2 = 0 . Since + a; + 1 = 0 , we infer 

Z 3 + LUZl + UJ^Z2 = 0 



and, multiplying by a;^, the conclusion. 



□ 



4.41 Napoleon’s theorem. It is said that Napoleon stated and proved the following 
result. On each side of an arbitrary triangle draw the exterior equilateral triangle. Then 
the barycenters of these three equilateral triangles are the vertices of a fourth equilateral 
triangle. In fact, listing the vertices anticlockwise, if zi,Z 2 ,Z 3 are the vertices of the 
triangle and 1 ^ 32 ^ 22 : 1 , wiZ3Z2i W2Z1Z3 are the exterior equilateral triangles, we have 

' On 

Z2 + CJZi + LJ^W3 = 0 , 

^ Wi + LJZ 3 + Uj'^Z2 = 0 , 

^ Z 3 + iJW2 + UJ^Zl = 0 , 

and the barycenters of the exterior triangles are given by 
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A 




_ Zi -j- Z2 -h W3 
~ 3 

_ Z2 -h Z3 -h Wi 

~ 3 ’ 

_ Zi + Z3 -h W2 

3 

(jj 

A —(Z 3 W2 Zi) + —(Z2 + Zl + W3) 

= ^ + i^Z3 + (jJ^Z2) + (Z3 + UJW2 + Uj'^Zl) + {Z2 + UJZl + U;‘^W3)^ 

= 0 . 

The following result, discovered by Frank Morley (1860-1937), is quite surprising 

4.42 Theorem (Morley). The intersections of the adjacent pairs of angle trisectors 
of an arbitrary triangle are the vertices of an equilateral triangle. 

4.43 Lemma. Suppose that are points on the unit circle. Then the exten- 

sions of the chords joining the points ti, t 2 and ^ 3,^4 meet at 

^ + ^2 ~ ^3 “ ^4 

tit2 — ^3^4 

4.44 Prove Lemma 4.43. 



I m 
m 
m 



Therefore 



m - 1 - = -{wi + 22 + Z3 



Proof of Morley theorem. For the sake of convenience assume that the triangle ABC 
is inscribed in the unit circle, A = 1, ZAOB = 37 , ZAOC = 3/9, /? < 0, and ZBOC = 
3a. Since circumferential angles are half the corresponding central angles, in order to find 
the intersections of the trisectors of vertex angles it suffices to trisect the corresponding 
central angles. We then call B in such a way that the intersection of the trisectors 
of the angle in c with the circle are the points c and . Similarly we set C = 6^ , 9^6 < 0, 
in such a way that the corresponding intersections are b and 6^ . The arguments of the 
intersections of the trisectors of the angle in A are 

a-f 37 = -/9 + 27+ 2a + 37 = - 2/5 + 7+ 

O O 

consequently the intersections of the trisectors of the angle A with the circle are the 
points ub'^c and ujbc^ where a; := see Figure 4.11. 
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Figure 4.11. Intersections of the angle trisectors with the circumscribed circle. 



If P, Q, R are the vertices of the triangle obtained intersecting adjacent trisectors, 
we compute, on account of Lemma 4.43, 

b~^ -|- — b~^ — c~^ bc^ + 6 ^ — — b^c 

^ b~^c~^ — b~^c~^ b — c 

= {b^ + 6c + c^) — bc{b + c), 

1 + b~‘^c~^LJ~^ — b~^ — c~^ _ b^c -\-b(jj — c — b^ 

^ b~‘^c~^uj~'^ — b~^c~^ 6a; — 1 



= ^c(6^ H- buP" + a;) — 6(6 + a;^)^ , 



_14-6 ^c ‘^uj ^—6 ^ — c ^_6c^H- ouP' — P — b 
6“^c~^a;“^ — b~^c~^ ca;^ — 1 

= (jj (b{P H- co; 4- a;^) — c(c 4- a;)^ . 

Finally, we infer 

p 4- OJQ 4- = 6^ 4- 6c 4- c^ — 6^c — bP 4- Pc 4- bcuj^ 

= coj — b^ — buP 4- bP 4- bcuj 4- buj^ — P — cuj = 0, 
and the claim follows from Proposition 4.40. 



4.4 Summing Up 

Complex numbers 

Complex numbers are points in the plane M?: one identifies 1 to (1,0) and denotes by 
i the number corrsponding to (0, 1). Complex addition then coincides with the sum of 
plane vectors. Any complex number 2 : = (x,y) then is written as x iy and complex 
multiplication reduces to standard rules plus P = —1. 

If z = X iy £ C, then 

^{z) := X, 



^(z) := y, 



^(z) = 



\z\^ := +y^ = zz. 
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Polar form 

Every complex number z ^ 0 appears in polar form as z = |2:|(cos0 + isin^) where 0, 
which is defined modulo a multiple of 27t, is called the argument of z. We have 
o for z = |z|(cos0 + isinO) and w = |i(;|(cosv? + isin(^), zw = \z\ \w\{cos{9 + <^) + 
isin(0 + (/?)). 

o DE Moivre’s FORMULA, z"^ = \z\^{cosn6 + iamnO). 

o The argument of a complex number z / 0 is not uniquely defined. In order to 
consider the argument as a real function arg (z) : C \ {0} — > R, we need to choose a 
determination, that is an interval of size 2tt in which to read the argument: a common 
choice is the principal determination arg : C \ {0} — > [0, 27t[. 



The n-th roots 

If It; G C \ {0}, there are n distinct n-th roots of w, i.e., n distinct solutions of the 
equation z'^ = w, given by 

Zj := \w\^^^e'^ n u;-?, j = 0, 1, . . . , n — 1, 

where cj := . The numbers 

Zj := j = 0, 1, . . . , n — 1 

are the n-th roots of unity, i.e., the solutions of z'^ = 1. 



Complex exponential 

Define the complex exponential by 

e^ := e^{cosy -h isiny), for all z = x iy £ C. 

o We have 

e° = l, e" = -l, e^“’=e^e“’ \/z,weC. 

o Euler’s formulas. If t g R, then 

:= cos t-\-i sin t, cos ut = , sin ut = . 

2 2i 

Complex notation appears as a great simplicification of the description of the harmonic 
functions. 

o the uniform circular motion on the unit circle with angular velocity cj passing through 
1 at t = 0, is described by t ^ t G R. 
o for A G C formulas 

rt pi\t _ 1 

D(e'^‘ ) = ds = , A 7^ 0, 

7o iA 

are handier than D{e^^ cos{bt)) = ae°'* cos{bt) — 6e“*sin(6t), and sin(6t)) = 

ae°'* sin(6t) -|- be°'^ cos(6t), or the corresponding formulas for the primitives, 
o Prostapheresis formulas. 



I = 2 cos , 

|gia _ gi/3 ^ 2i sin . 



They clearly explain the beating phenomenon between two oscillators, even with 
different amplitudes. 









: e 2 ^ . 



cie 



+C2C 



o 



}■ 
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4.5 Exercises 



4.45 %, Write the following complex numbers in polar form: 
5 — 5i, 1 + iVs, 2 — 5i, 1 — i. 



4.46 %. Determine the points in the complex plane such that 



1 

3 ^- = 

2 a 



— 1 

= 0 , 



■ z| + I 2 : H- i| < 4, 



■ zi 



Z — Z2 



= k. 



z + 1 

Describe algebraically the sets 

(i) of the points that have distance at most 1 from the imaginary axis; 

(ii) of the points in the positive half-plane, with distance at least 2 from the origin 



4.47 %, Write in the form a-\- ib the numbers 



2-i 







2 



4.48 K. Compute 

4.49 f. Verify the following: 

cos 39 = cos^ 0 — 3 cos 0 sin^ 0, 
sin 30 = 3 cos^ 0 sin 0 — sin^ 0, 

cos 40 = cos"^ 0 — 6 cos^ 0 sin^ 0 -h sin'^ 0 = 1 — 8 cos^ 0 sin^ 0, 
sin 40 = 4 cos^ 0 sin 0 — 4 cos 0 sin^ 0. 



4.50 Compute 

(l-f-^)^ (3-3^)^ (-5 + 5i)®. 

4.51 de Moivre’s formula allows us to express cosn0 and sinn0 by means of cos 
and sin0. Find those formulas. [Hint: Use Newton’s binomial.] 

4.52 % Fagnano formula. Show that 2i log = tt. 



4.53 Infer the following equalities from Euler’s formulas: 

3 1 o 3 

cos X — - cos 3aj H — cos x, 

4 4 

• 4 1 >. 1^3 

sm X = - cos 4x = - cos 2x -| — , 

8 2 8 

• 5 1 . r 3 . ^ 5 . 

sm X = — sm 5x sm 3x H — sm x. 

16 16 8 



4.54 Prove that 

sin 0 H- sin 20 + sin 30 H h sin nO = 

1 -I- cos 0 4- cos 20 + cos 30 H (- cos nO = 

[Hint: Recall = cos 0 4 - 1 sin 0 and de Moivre’s formula.] 



sin ^0 . nO 

^ — sm — , 

sinf 2 

sin(n + |)0 

2sinf 
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4.55 f. Compute 

v^, Vi, V2 - 2i, \Jl + iVs, ^8 - 8i, V^. 

4.56 ^ If. Denote by eo, • . . , Cn-i the n-th roots of unity. Show that they form a mul- 

tiplicative group of finite order n, (i.e., only the first n — 1 roots are distinct. We say 
that e G Gn is a generator of Gn if the elements 1, e, e^, . . . , are distinct. Show 

that eh is a generator of Gn if and only if h and n are coprime. 

4.57 Show that the sum of the n-th roots of a number is zero. 



4.58 Solve the equations 



-h Siz -1-4 = 0, 
z'^ -\-2z + i = 0, 
zjzj -2z-l=0, 
jzj^z^ = i, 
z"^ zz = l-\- 2i, 



z^ = z, z^ +iz = 1, 

{z + if = {VS + if, 

z"^ = z^, z^=izz, 

^z^ = \z\^, zjzl - 2Uiz = 0, 

^z=llzl^. 



4.59 Show that 2 : and cz are orthogonal in ~ C if and only if c is purely imaginary. 



4.60 f . Verify that 



log(-5) = log 5 -I- ITT, 

log(--\/3 -hi) = log2-hi-7T, 

6 

1 7T 

log(7 -h 7i) = log7 -h - log2 -h i-. 

4.61 Interpret in complex notation the two-squares theorem of Diophantus of 
Alexandria (200-284) 

{u^ -h v^){x^ -h y^) = {ux — vy)^ -h {uy -f vx)'^ . 

4.62 f. Let 21 , Z 2 i Z 3 , Z 4 be four points on a circle centered at the origin. Show that 
the following claims are equivalent: 

(i) zi, Z 2 , Z 3 , Z 4 are the vertices of a rectangle, 

(ii) Zi -h Z2 + Z3 -h Z4 = 0, 

(iii) zi, Z 2 , Z 3 , Z 4 are the roots of an equation of the type ( 2 :^ — a?) (z^^ — 6^) = 0 
with |a| = |6| i=- 0. 

4.63 11. o / : C — > C is said to be an isometry or is distance preserving if \f(w) — 
f{z)\ = I 2 : — it;| "iz,w e C. Show that / is distance preserving if and only if f{z) := 
/(O) + bz or f{z) = /(O) -h bz with |6| = 1. 

o / : C ^ C is said to be R-linear if f{z) = ^z /(I) -h ^z f{i). Show that / is R-linear 
if and only if f{z) = az -\-bz with a, 6 G C. 

o An R-linear map / : C — > C is said to be orthogonal if ^{f{z)f{w)) = ^{zw) for all 
z,w eC. Show that / is orthogonal if and only if f{x) = az or f{z) = az with a G C 
and |a| = 1. 

[Hint: For the first claim, consider g{z) = (f{z) — /(0))/(/(l) - /(O)), and show that 
Is(z)|2 = |^|2 and |s(z) - Ip = \z - Ip.] 
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Abu al Khwarizmi 
(790-850) 



Abu al Kindi 
(805-873) 



Hunayn ibn-Ishaq 
(808-873) 



Abu al Mahani 
(820-880) 



ibn Qurra Abu’l Thabit 
(826-901) 



Abu Shuja Kamil 
(850-930) 



Abu’l al Uqlidisi 
(920-980) 



Abu al-Battani 
(850-929) 



Sinan-ibn-Thabit 

(880-943) 



Abu Jafar al Khazin 
(900-971) 



al-Buzjani Abu’l Wafa 
(940-998) 



Abu al Quhi 
(940-1000) 



al Karkhi 
(953-1029) 



Abu Ali al Haytham 
(965-1039) 



Abu al Biruni 
(973-1048) 



ibn Sina Avicenna 
(980-1037) 



ibn Tahir AlBaghdadi 
(980-1037) 



Omax-Khayyam 

(1048-1122) 



Ibn al Samawal 
(1130-1180) 



Nasir-al Tusi 
(1201-1274) 



al Farisi Kamal 
(1260-1320) 



Ghiyath al Kashi 
(1390-1450) 



Ulugh-Beg 

(1393-1449) 



Figure 4.12. The arab renaissance. 





Filippo Brunelleschi 
(1377-1446) 


Leone Alberti 


della Francesca Piero 


(1404-1472) 


(1412-1492) 

Andrea Mantegna 
(1431-1506) 

Leonardo da Vinci 
(1452-1519) 

Albrecht Diirer 
(1471-1528) 



Figure 4.13. Mathematics and art in the Renaissance period. 





5. Polynomials, Rational 
Functions and Trigonometric 
Polynomials 



In this chapter we want to illustrate the relevance of complex numbers 
in some elementary situations. After a brief discussion of the algebra of 
polynomials in Section 5.1, we prove the fundamental theorem of algebra 
and discuss solutions by radicals of algebraic equations in Section 5.2. 
In Section 5.3 we present Hermite^s decomposition formulas for rational 
functions, which are useful for the integration of rational functions, see 
Chapter 4 of [GMl]. Finally, in Section 5.4, we discuss some basic facts 
about trigonometric polynomials and, more generally, sums of sinusoidal 
signals. In particular we shall see that the spectrum of a signal completely 
identifies the signal itself, we shall prove the energy identity and present a 
sampling formula. 



5.1 Polynomials 

Let K be a field as, for instance, C, R, Q, or a finite field as, for example, 
the residue class Zp, p prime. A polynomial with coefficients in K in the 
indeterminate x is an expression of the form 

p 

P{x) := Oq H- aix + 02X^ H h OpX^ = ^ ajx^ , ^ 

j =0 

where aj G K for all j = 0, . . . ,p. The class of all polynomials with coeffi- 
cients in K in the indeterminate x will be denoted by K[x]. 

Presently a polynomial P{x) G K[x] is not a function defined in some 
domain, but, instead, a formal expression defined essentially by the list of 
its coefficients. In fact we say that two polynomials P{x) = 

Q{x) = equal if aj = bj Vj = 0, . . . , min(n, m) and aj = 0 

Vj = min(n, m) -h 1, . . . , n and bj = 0 Vj = min(n, m) -|- 1 , . . . , m. 

We can therefore extend, if this is convenient, the list of the coeffi- 
cients of a polynomial by adding zeros as coefficients of higher order terms 
without changing the polynomial itself. 



ao + is the actual meaning of the sum! 
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HIERONYMI CAR 

OANi* MATHE- 

A R T 1 S ”m A G N -k. 

OPVS PaRPECTVM 



L’A L G EBR A 

OPERA 

DiRAIAIL BologcU 

DtuiQ in (xe Libei. 

it^ienr^fUA tmic* 4^ Jnimttia. 

Con Tia Tilted cojjlera dc[L&«»i«k, eb« 

( oUtcii^iua . 

Jtiid . 




m BOLOGKA, 

Pot Gioua tiru RoSt . MdLXXIX, 
Cpb iktKf if 



Figure 5.1. The frontispieces of Ars Magna by Girolamo Cardano (1501-1576) and of 
Albebm by Rafael Bombelli (1526-1573). 



If P{x) = ^ K[x]^ the largest integer j for which aj / 

0 is called the degree of P and is denoted by degP. Nonzero constant 
polynomials have degree 0, and the zero-polynomial, that is the polynomial 
with Oj = 0 Vj, is given degree -oc. 

Polynomials in K[x] can be added and multiplied. If P{x) = 
and Q{x) = ^ K[x], and assuming for instance p > g, we define 

the sum of P and Q by 



P{x) + Q(x) := ^(aj + bj)x^ 

j=0 

where we have set bj = 0 for j = g + 1, . . . ,p. Of course deg(P + Q) < 
max(degP,degQ). The product of P and Q is then defined by 

P{x)Q{x) = ( X! ■= Y1 

i =0 i =0 i =0 j =0 fc =0 



where 

Cfc *” ^ ^ CLibj 

i,j 

i+j=^k 



or, explicitly. 
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M T> i.1^1. 
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Figure 5.2. Frontispieces of the first and fifth parts of the General Trattato sui numeri 
of Niccolo Fontana (1500-1557), called Tartaglia. 



Co = ao^o, 

Cl = aibo + ^io^i5 

C2 = ^2^0 Cilbi + 



Cn = CLnbo + O^n-1^1 + ^n-2^2 + • * ‘ + Q^l&n-l + 



Notice that we have extended the list of the coefficients of the polynomials 
by setting aj := 0 for j = p+1, . . . ,p+9, and bj := 0 for j = g+1, . . . ,p+^- 
It is easy to see that deg(P(3) = deg P deg Q. 

5.1 If. Show that the product of two polynomials is zero if and only if one of the two 
polynomials is zero. This is expressed by saying that K[x] is an integral domain. 



5.1.1 The Division Algorithm 

Given two polynomials A{x) = ~ ^ 

with deg B = m < n, we observe that 

A^{x) A{x) - e K[x] 

has degree less than n. Proceeding inductively, it is not difficult to show 
that 
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Figure 5.3. The frontispiece of the Opere Matematiche by Paolo Ruffini (1765-1822) 
and the first page of the first chapter of the Teoria generale delle equazioni in cui si 
dimostra impossibile la soluzione algebrica delle equazioni di grado superiore al quarto 
by Paolo Ruffini (1765-1822). 



5.2 Theorem (Division algorithm). For given A^B e K[x] with B ^ 
0, there are uniquely defined polynomials Q,R G K[x] such that 

A = BQ -h R and deg R < deg B. 

The polynomial Q in Theorem 5.2 is called the integral quotient of A by 
B, and R is called the remainder of A divided by B. 



a. Euclid’s algorithm and Bezout identity 

Let A^B G K[a:] be two nonzero polynomials. We say that B divides A or 
that B is a divisor of A if ^ = BQ for some Q G K[r]. Notice that, if B is 
a divisor of A, then XB for A G K, A ^ 0, is a divisor of A^ too. Notice that 
this contrasts with the notion of integral divisor of an integral number. 
We shall say that A is irreducible in K if it has no divisors. In this case if 
A = BQj then either B or Q reduces to a constant polynomial. 

We now look for the common divisors of A and B. Clearly polynomials 
of degree zero are common divisors of A and B; also, every common divisor 
to A and B divides PA + QB for all polynomials P, Q. 

We say that a subset X C K[x] is an ideal of K[x] if J is a subgroup with 
respect to the addition, and it is closed with respect to the multiplication 
with any element, that is, PQ C I VP G K[x], VQ G J. 
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5.3 Proposition. Every nonzero ideal J of polynomials with one inde- 
terminate is a principal ideal, that is, it contains only multiples (in K[x] j 
of a polynomial that is unique up to a multiplication by a constant. 

Proof. Let B be a polynomial in X with minimal degree and let A be 
any other element in X. By dividing ^4 by B we have A = BQ -h R, thus 
R = A — BQ belongs to X. Since degi? < degB, we conclude that i? = 0, 
i.e., A = BQ. Suppose now that B' eX and degB' = degB; B' = BQ 
yields degQ = 0, i.e., B' = \B, A G C. □ 



Since 



I:= {p^ + QB|p,QeK[x]} 



is an ideal of K[x], we conclude that there exists a polynomial D, uniquely 
defined modulo a multiplicative constant, such that every polynomial PA-\- 
QB in X is a multiple of D. In particular 



o D = AP + BQ for_some P, Q, 

o A = AD and B = BD, since lA + OB, 0i4 + l*BGJ. 

We therefore conclude that every common divisor of A and B divides D 
and that every divisor of D divides both A and B. That is, D is the (up 
to a multiplicative constant) greatest common divisor of A and B. With 
some abuse of notation, it is denoted by g.c.d. {A, B). 

As for integers, Euclid^s algorithm and Euclid^s generalized algorithm 
yield a way to compute the greatest common divisor of A and B together 
with polynomials U, V such that AU -\-BV = g.c.d. {A, B). Assume deg A > 
degB and define the three sequences {Rk}, {Uk}^ {^k} by 



jRo •— A, Ri := B, 

Rk+l — Rk—1 QkRk'i 



I Uo := 1, C/i := 0, 

] = Uk-i - QkUk, 



Vo := 0 , := 1 , 

. Vfc+i = Uk-i — QkVk, 



until iin+i ih 0- 

5.4 Theorem (Euclid). We have Rn := g.c.d. {A, B), and 
g.c.d. {A, B) = Rn = AUn-\- B Vn- 

Proof. In fact, noticing that C divides A and B if and only if C divides B 
and the remainder R := A — BQ, by induction one proves 
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g.c.d. {A, B) = g.c.d. (J?o, Ri) = g-c.d. (J?i, i? 2 ) 

— * • • — g.c.d. Rn) Rri‘ 

(ii) It suffices to check by induction that Rk = AUk~\- BVk^k = 0, . . . ^n. 

□ 



b. Factorization 

Two polynomials that have a constant polynomial as greatest common 
divisor are said to be coprime, and a polynomial is said to be prime or 
irreducible over K if it has no divisors except for the nonzero constants. 

Prom Euclid’s algorithm, it is easy to see that g.c.d. {AC, BC) = 
g.c.d. {A, B) C. This implies the following. 

5.5 Theorem (Euclid). If A divides BC and A and B are coprime, then 
A divides C. 

This, in turn, allows us to prove as for integers the following. 

5.6 Theorem (Unique factorization). Every polynomial in K[x] can 
be uniquely written as a product of irreducible factors. 

Thus irreducible factors play in K[x] the same role as prime numbers 
in arithmetic. 

5.7 Remark. We notice that the notion of irreducible polynomial de- 
pends on the field K of coefficients. For instance, — 4 is not prime in 
Q[x], x‘^ — 2 is prime in Q[x] but not in R[x], nor in C[x], and -f- 1 is 
prime in R[x] but not in C[x] . 

c. The factor theorem 

A polynomial P{x) = ^ regarded as a function 

P : K — > K which maps z e K P{z) := which we call the 

polynomial function of P. In general, two different polynomials may have 
the same polynomial function, for instance x + 1 and x^ + 1 on Z 2 , but, 
as we shall see in a moment, two polynomials are identical if and only if 
they have the same polynomial function, provided the field K is infinite. 

We say that a G K is a zero of a polynomial P G K[x] if P(a) = 0. 
Prom the division algorithm theorem we infer at once 

5.8 Theorem (Ruffini). Let P(x) = ^2^=0 ^ 3 ^^ ^ ^ polynomial 

of degree p and let a G K. Then P(x) = (x — a)Qc^(x) A P{a) Vx G K 
where Qa(^) = 

{ bp— I = Op, 

bj-i =Oj -\-a bj (in K, Vj = p - l,p - 2, . . . , 1. 
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Proof. In fact, 

p—i p—i p—i 

{x - a)Qa{x) = ^2 — OL ^2 ^ 3 ^^ — ^p-1^^ + - abj)x^ — abo 

j=l j =0 j=l 

p-i 

= Opx"^ + OjX^ — a6o = P{x) — (ao — abo). 
j=i 



□ 



Theorem 5.8 yields the following. 

5.9 Theorem (Factor theorem). Let P e K[x] and a eK. Then x — a 
divides P{x) if and only if a is a root of P, P(ct) = 0. 

If we know k distinct roots xi, X 2 , . . . , of P, we can write inductively 
P(x) = {x- Xi)Qi{x), P{x) = {x- Xi){x - X 2 )Q 2 {x) as Qi{x 2 ) =0, . . . , 
and finally 

P{x) = {x- Xi)(x - X 2 ) • • • (x - Xk)Qk{z), (5.1) 

where degQ^ = degP — k. In particular we cannot exceed degP, that is, 
every polynomial of degree n has at most n roots. 

5.10 Theorem (Principle of identity of polynomials). Two polyno- 
mials P and Q e K[a:] of degree at most n are equal in K[x] if and only 
if their polynomial functions take equal values in at least n + 1 distinct 
points of K. In particular, if P has degree n and its polynomial function 
vanishes in at least n + 1 distinct points, then P = 0 in K[x]. 

5.11 P{x) in the indeterminate x — a. By using the factor theorem, 
it is easy to rewite P G K[x] as a polynomial in the indeterminate x — a. 
We have the following. 

Proposition. Let P be a polynomial of degree p and let a E K. Let 
Qp, Qp-i, . . . ,Qi be the polynomials of degrees respectively p, p—1 , . . . , 2, 1 
obtained iteratively by Rufhnrs rule, i.e., 

f Qn •= P? 

\ Qj{z)-Qj{a) = {z-a)Qj-i{z) Vj = n - 1, . . . , 2, 1. 

Then P{x) = P{a) + Qp-M){x - aY . 

Proof. In fact, 

Qo{x) = Qo(«), 

Qi{x) = Qi{a) -\-{x- a)Qo{x), 

Q2{x) = Q2{ct) + {x- a)Qi(x) = Q2{oi) + (x - a)Qi(a) + (x - a)^Qo(a), 



p 

P{x) = Qp{x) = ^ Qp-j(a){x - ay. 



j=o 



□ 
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Figure 5.4. The Italian Renaissance is probably the turning point for the development 
of modern western culture. The knowledge of artists, technicians and merchants merges 
with school education. Luca Pacioli (1445-1517) writes for “curious and ingenious en- 
gineers and for any scholar of philosophy, perspective, painting, sculpture, architecture, 
music and other mathematics.” 



5.12 Complex derivative. Let P{z) = ^ 

complex derivative of P is defined as the polynomial of degree n — 1, 

DP{z) = P'{z) := '^jajiz - ay~'^. 

J=i 



Of course D^P(z) := D(DP)(z) = E ”=2 and 



D'‘P(z) 



'' n 

y^JU -k + l)aj(z - ay~'° ifk<n, 

0 if fc > n. 



In particular for 0 < fc < n, D^A{a) = Consequently we infer, 

compare Taylor’s formula in [GMl], 



= L ,-i i^-(^y- 



(5.2) 
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Figure 5.5. The famous architect Leon 
Battista Alberti (1404-1472) was the 
theorist of mathematical perspective. 
His ideas were presented in De Pictura, 
1511, while in Ludi Mathematici he dis- 
cussed applications of mathematics to 
various practical problems. The fron- 
tispiece of his De re aedificatoria, or- 
ganic summa of the architecture of his 
time. 
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5.1.2 The fundamental theorem of algebra 

Finding the irreducible polynomials in K[x] is obviously a key point of the 
algebra of polynomials. We shall restrict ourselves to factorization in the 
fields R and C. 



a. Factorization in C 

Let P{z) = Ylk=: 0 ^kZ^ be a polynomial in €[ 2 ;] of degree n, i.e., an 7 ^ 0. 
Trivially every root of P{z), that is a point ^ G C such that P(^) = 0, is a 
minimum point for the real- valued function 2 : \P{^)\- An interesting fact 

discovered by Jean d’Alembert (1717-1783) and Jean Argand (1768-1822) 
is that the converse holds true. 

5.13 Proposition. Let ^ be a local minimizer for \P{z)\, i.e., there is a 

diskB{^,p) of radius p and center^ such that |P(0I < l^(^)l ^ 

then |P(0I = 0- 

This is a consequence of the following. 

5.14 Lemma (d’Alembert). Let zq e C be such that P{zo) 7 ^ 0; then 
for anye > 0 we can find fi G C with |fi| < e such that \P{zo-\-h)\ < |P( 2 ;o)|. 

In order to prove d’Alembert’s lemma we make the following remarks. 
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o Let P{z) = let a G C. As we have seen, in 5.11, we can 

write P{z) as 

P{z)-P{a) = j2Mz-ay 

J = 1 

where the coefficients Aj depend on the coefficients of P and a but 
not on 2 ;. Denote by m the smallest integer such that Aj ^ 0. Clearly 
1 < m < p and 

P{z)-P{a) = j^Ayz-ay. (5.3) 

j=m 

o Prom (5.3) and the triangle inequality, we infer 

\P{z) - P{a)\ = I Aj{z - ay\ < ^ - a^, 

j—m j=m 

therefore for all z with — aj < 1 we have 

\P{z)-P{a)\<k\z-ar A::=^|Aj|. (5.4) 

j=m 

Proof of Lemma 5.14- According to the above, for ft G C we write 

n 

P(zo + h) = P(zo) + ^A,hy (5.5) 

i=i 

and, if m is the smallest integer with Am ^ 0 and 

n 

Q{h),= Y. 

j=m+l 

we rewrite (5.5) as 

P{^o + ft) = -^(- 2 ^ 0 ) ■+■ Amh^ + Q{h). 

We now choose fto in such a way that A^^fto^ is in the opposite direction 
of P(zo), 

AmhlS^ = -P(zo), (5.6) 

i.e., we choose fto as one of the m-th roots of -P(zo)/Am, which is possible, 
since we are working in C. Then we set ft = pfto, p small and precisely 
p|fto| = |ft| < 1). Prom (5.6) we then infer 

|Q(/i)| < k\hr^^ = \horp^^^ = ^/9-+^|P(Zo)| 



hence 
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Figure 5.6. Jean d’Alembert (1717-1783) and Carl Friedrich Gauss (1777-1855). 



IQ(ft)l < \p^\P{zo% 

if, moreover, p < |i4m|/(2fc|/io|). Finally, from (5.5) and the triangle in- 
equality we conclude 

\P{zo + h)\<{l-p^\P{zo)\ + \Q{h)\ 

<{l-p^ + \p^)\P{zo)\<\P{zo)\. 

□ 

Besides Proposition 5.13 we also have 

5.15 Lemma (Coercivity). Let P{z) = ^ polynomial of 

degree n > 1. Then 

lim |P(z)i = +oo. 

|2:Hoo 

Proof. Factoring out the term of highest degree, we have 

n 

P{z) = a„^"(l + Q{\lz)), Q{w) := 

j=i 

If k := kn-j/anl, \z\ > 1 and |^| > 2k, applying (5.4) to Q we get 
\Qii/z)\ = m/z)-Qm<^^<\. 

Consequently, using the triangle inequality 11 + ^1 > |1 — |g||, 

\P{z)\ = |a„||^r |l + Q{l/z)\ > Kllzr |l - \Q{l/z)\ 

= |o„||z|"(l - \Q{\!z)\) > ^|a„|| 2 :|"; 



this yields the result at once. 



□ 





156 



5. Polynomials, Rational Functions and Trigonometric Polynomials 



10(7 



DEMOJTETRiTtO NOVA ALTERA 

THGQKBUATJt 

OMNEM FVSVCTiONEM AtGEBRAICAM 
ration ALEM INTECRAM VNJVS VAKtABJUS 

IH 

FACTDREJ BEALES FBIHl VEL StCVKDE CRADV^ 
REfC>LV( POSE 

t 

CAROLO FRJDBRICO CAVSS 

SOC- ICa St UtAVtTA PIK- T- 



Q uinipuin AfMnBEVtH 4i ftfdflilOBi 

Ib hfCCbfii, !■ (4IB>nfetM!qfii' 

iBBlf pTMomljiu kndidi < Intii l%«rti 

Isn inplicjuth ■ihJI TlibTvf, timu 

hiuA iDpitom fw» rp*f(fK n litrim jA ttnUm ^rntf 

ibiM^ FiiiMrurt t |ctnelfU* ffstfai 

duMnQrttkHH *Ih*>*I uEh* MA^uwa <vtHr^ 



HitaaLbaa («4SiMtldii t* , ^ u^<r« i^ndw, 

prklcljiSu miti iBilftlcb Laiiu «dl. MHEadimjn uiF3fi|ti- 
raB, p-tr Tiq,ai ii Ulad qaiAta tKtipM <1U [*»nstiT»* cht»- 
nnt JiBanrnn fvrwtWdit, lae? ciuiB 

laawitiiif II ^u-lbu viLiii liborvBE «i^fitlr ^uOraia |jm^ 

-0 a vJJ&Aikoi 






THEOIXEMATU 

DE 

RESOLVillLlTATE FVE^CTJOPJVM ALGE- i 

URAICARVM l]VI>:CRARVM IN FACTORES ' 

REALES ! 

DEMONSTRATIO TERTIA ' 

vcf 

CAROLO TRLDEIUCO CAVES. 



IVFFtEHENTVU COMMENTATIONIS 



soaetATi BEauK fdtsmARv* niADiTVif md jas. 



letHmitiiba priie«d4B| ijrpli [n> Hpriifl lUit, llIntH 
da rtiiiui' krfuuHn44 nlditfelCcDEi id riojiu lEpcfinuiib Aimn* 
briLLitiMi fMdaiiTBBt. qalArn M pckKtdIti pofi 

amlTlin ili M jrlnrLpUi prutBi diiudli f>niiirqri tl Tir^''(i 
EdiplUiilib mi frk#f«Mf)dl rjjHur. tlul« (lj^bi art 

till dcBiBinAiTlilaiii fl^Biitfi ificidia Tbiw.. 

BnipbAli Cl CnBriio indi L mbiie i.1j* ^ timi i 
Iisjn + +&!--» + O*--! + i«L + Fis + *f 

inqni E»!fArl<Bi44 A,H,Qai£. funti twIm dftn’WulBf- 

Ci»l p iUm EBJiticmiHlIir AllGlBiBi^gi 



Figure 5.7. The first pages of two of the papers dedicated by Carl Friedrich Gauss 
(1777-1855) to the fundamental theorem of algebra. 



On account of coercivity, d’Alembert thought that the existence of a 
minimum point for the function \P{z)\ was evident and concluded 

5.16 Theorem. Every nonconstant polynomial with complex coefficients 
has at least a complex root. 

However, existence of a minimizer of |P(^)| is not at all evident and 
trivial. It is a consequence of the continuity of polynomial functions and of 
the continuity or completeness of C. In fact, from Weierstrass’s theorem. 
Theorem 5.16, and the factor theorem. Theorem 5.9, we readily conclude 

5.17 Theorem (Fundamental theorem of algebra). Every complex 
polynomial of degree n > 1 factorizes as a product of n polynomials of 
first degree, 

P{z) = an{z - ai){z - 0 : 2 ) • • • (z - a„). 



b. Simple and multiple roots of a polynomial 

5.18 Definition. Let P G K[x], a G K, and let k > 1. We say that a 
is a root of P of multiplicity k, 1 < k < n, if {x — a)^ divides P{x) and 
{x — does not divide P. A simple root of P is a root of multiplicity 1. 

Notice that, from the factor theorem, a is a root of multiplicity k if 
and only if 

P{x) = {x - a)^Q{x), Q{a) ^ 0. 

Let zi,. . . ,Zk be the roots of P and assume that they have multiplici- 
ties respectively ni, . . . , n^. Then the fundamental theorem is rewritten as 
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every polynomial P G C[z] of degree n has exactly n roots, when counted 

with their multiplicities, i.e., 

P{z) = an{z - zi)'^^{z - Z2Y’^ • • • (2; - ni +ri2 H h = n. 

The roots of a polynomial and of its derivative are related. It is easy 

to prove the following: 

o A simple root of a polynomial P is not a root of its derivative P'; 
consequently P and P' are coprime if and only if all the roots of P 
are simple. 

o A root of multiplicity fc of a polynomial is a root of its derivative of 
multiplicity k — 1 . 

o Let P be a polynomial. Then Pq := P/g.c.d. (P, P') has the same set of 
zeros of P but all its roots are simple. 

The following claims are easy consequences of Rolle’s theorem: 

o If all roots of a real polynomial are real, then all roots of its derivative 
are also real. 

o If all roots of a real polynomial P are real and of those p are positive, 
then P' has p or p — 1 positive roots. 



c. Factorization in R 



If P{z) = ^ conjugate polynomial is defined by 

Of course 



n 

P{z) = '^akz'^ = P{z). 

k =0 

It follows: a is a root of P withjmultiplicity h if and only if a is a root for 
P of multiplicity h. Since P = P for polynomials with real coefficients, we 
deduce 

5.19 Proposition. Every real polynomial has n complex roots when 
counted with their multiplicities; an even number of them are nonreal 
and come in couples of conjugate complex numbers. 

As a corollary, on account of the fundamental theorem of algebra, we 
have proved again that every real polynomial with odd degree has at least 
one real root. 

Let P be a polynomial of degree n with real coefficients and let 
oi, 02? • • • ) CKp be its real roots with respective multiplicities fci, k2,.. . ,kp. 
Moreover, let /?i, /?2, • • • be its complex roots with positive imaginary 
parts and multiplicities hi, h2,. ,hq. Since also /?i, /?2? • • • » 2u:e roots 
of P with multiplicities hi, h2, - • . ,hq, we find, on account of the funda- 
mental theorem of algebra, that fci H h fcp + 2 hi H h 2 hq = n. 
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P(^) =anfl(z- f[iz- {z - , 

j=i i=i 

and, writing =: • + icjj so that 

{z - OLj){z “ aj) — (x- hj)‘^ + Cp 

we conclude 

p Q 

P{x) = an\[{x - aj)^^ l[{ix-bj)^ + c^j)^^ V x 6 R. 

3=1 3=1 

We therefore have the following. 

5.20 Proposition. Every real polynomial can be factorized as a product 
of first and second order irreducible polynomials in R. 



5.2 Solutions of Polynomial Equations 

The Italian Renaissance marks a tremendous renewal of interest in nature, 
and also in mathematics. Artists studied and employed mathematics in- 
tensively, among them let us mention Filippo Brunelleschi (1377-1446), 
Paolo Uccello (1397-1475), Masaccio (1401-1428), Leon Battista Alberti 
(1404-1472), and Piero della Francesca (1410-1492), who set forth the 
mathematical principle of perspective. The development of banking and 
commercial activities called for an improved arithmetic. The Summa by 
Luca Pacioli (1445-1517) and the General trattato dei numeri e misure by 
Niccolo Fontana (1500-1557), called Tartaglia, contained many problems 
on what one could call numerable mathematics. 

With respect to the topic we are discussing, the new flourishing of 
mathematical studies led to the discovery of formulas for solving alge- 
braic equations of degree 3 and 4 and the consequent introduction of the 
imaginary unity. These developments are connected to the names of Scipi- 
one del Ferro (1465-1526), Niccolo Fontana (1500-1557), called Tartaglia, 
Girolamo Cardano (1501-1576) and Rafael Bombelli (1526-1573) and are 
presented in Cardano’s Ars Magna and Bombelli’s Algebra. 



5.2.1 Solutions by radicals 

Equations of second degree 

-h + c = 0, 



a^bjC € C, a 0, 



(5.7) 
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Figure 5.8. The first page of Traite des substitutions et des equations algehriques by 
Camille Jordan (1838-1922) and of Teoria dei gruppi di sostituzioni by Luigi Bianchi 
(1856-1928). 



can be easily solved in C as everybody knows. In fact, completing the 
square we get 






62 N 
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hence 

(2a{z + + 4ac — b^ = 0. 

Thus solutions are given by 



—b±w 



where w and —w are the roots of = b‘^ — 4ac. 

5.21 Third degree equations. Complex numbers were introduced as 
an intermediate step to solve the third degree equation 

— 3px — 2q = 0, p, g G R, p 0, (5.8) 

that, as we know, has at least a real solution. Set x = u-^v^p = uv^ so 
that (5.8) becomes 

u^-\-v^-2q = 0 (5.9) 

hence 

- 2qu^ + p3 = 0, 

since v = pju. The last equation is solved by 
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while (5.9) yields 

= q — yjq^ —p^. 



Thus 



X = u + v = ^q-\~ yf^ - + ^q - y/^ - 



is a solution of (5.8). This is of course correct if q^ — > 0, otherwise 

the solving formula is meaningless (at least for the Renaissance people). 
However, if we introduce the imaginary unit i := y/^ in the case q‘^ —p^ < 
0 , we have 



= q-{-iy^p^ - (P‘, 
= q — iy/p^ — q^. 



(5.10) 



If u = a + is a cubic root of q + iy/p^ — we see from (5.10) that 
V := a — ib is a. cubic root of g — iy/p^ — therefore the imaginary parts 
cancel if we sum u + v, finding a real root. 



5.22 Example. If we consider the equation 

- 15a: - 4 = 0, p = 5, g = 2, (5.11) 

we find X = ^2 + Hi + v^2 — Hi. If we try to express 2 + Hi as the cube of a complex 
number, we find (2 -f i)^ = 8 + 12i + 6i^ + = 2 + Hi, while (2 — i)^ = 2 — Hi, hence 

x = 2 + i + 2 — i = 4isa solution of (5.11). 

An adjustment of the method just presented allows us to solve third 
degree equations. We want to solve in C, 

P{z) = aoz^ -\-aiz^ -\-a 2 z-\-as = 0^ ao, ai, U 2, 03 ^ C, ao 0. (5.12) 

We see that P"{a) = 0, if a := — ai/(3ao), hence 

P{z) = ao{z - a)^ + P'{o^){z - a) + P{a), 

and z solves (5.12) if and only if y := z — a solves aoy^-\-P'{a)y-]-P{a) = 0. 
Therefore it suffices to solve equations of the form 

z^+pz-{-q = 0, p,qeC. (5.13) 

The idea is to look for solutions of the form z = Inserting z = u-\-v 
in (5.13), we see that z is a solution if u and v satisfy 



uv = -p/3, 
-{-v^ = — g. 



or 

= —p^ 127^ -\-v^ — —q. 
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Figure 5.9. Evariste Galois (1811-1832) 
and the frontispiece of the Teoria geo- 
metrica delle equazioni by Federigo En- 
riques (1871-1946). 

This happens if and are the two solutions ri, r 2 of the second degree 
equation 

+ qr 121 = 0. 

The numbers u and v are then to be chosen among the cubic roots of r\ 
and r 2 - Set 

:= |ri|V3e^^, 

. ^0 := |r2r/^e^^, 

:= -l/2 + z\/3/2. 

Then the solutions of (5.13) are among the numbers 

z = uouj'^ +uoo;-^, i,j = 0,1,2. 

Since uqcj'^vouj^ = uoVqu;^'^^ = —p/3 only for i -f- j = 3 we conclude that 
Zi = uqu;" + vquj^~\ i = 0, 1, 2, 
are the three solutions of equation (5.13). 

5.23 Fourth degree equations. Suppose we want to solve 

P{z) = aoz"^ + aiz^ + a2Z^^ + a^z + 04, G C, ao ^ 0. (5.14) 

We observe that, if a = — ai/4ao, then P'"(a) = 0 , hence 

P{z) = ao(z - a)^ + 




{z - a)'^ + P'{a){z - a) + P{a), (5.15) 
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and solves (5.14) if and only ii y := z — a solves (5.15). Therefore it 
suffices to solve 



-f- r = 0, p, O', reR. (5.16) 

We look for solutions of the type z — u + v-^w. Inserting into the equation 
we find 



z^ — 2{v? + + w^)z‘^ — Suvwz + {u^ + + w^)‘^ 

- 4{u^v‘^ + 4 - v^w‘^) = 0 , 



therefore z = u + v-j-w is a, solution if 



^2 _|_ ^2 ^ ^2 _ — p / 2 , 
i / g , 

y2y2 ^ -^2^2 _j_ y2yp. _ ^p2 _ 4y.yi6. 



By computation, see Exercise 5.61, and are the three solutions 

2/1 5 2/2 5 2/3 of the third degree equation 



q p 0 — 4r 

+ -V + V - — 

^ 2^ 16 ^ 64 



0 . 



Consequently u, v, w are to be chosen among the square roots ±tio, iuo, 
±t^;o of Pi, 1/25 2/3- If we choose wq in such a way that uqVqWq = —q/S, then 
we conclude that 

f 

2^1 = uo + i;o + ii'oj 

^ Z 2 =Uq-Vq- Wq, 

2^3 = -UQ + 1^0 - 11 ^ 0 , 

^Z 4 = -Uq -Vq-^-Wq 

are the four solutions of equation (5.16). 



5.24 Solutions by radicals. The study of algebraic equations and, es- 
pecially, the research of a procedure for solving algebraic equations, i.e., 
finding the roots of a given equation from its coefficients by means of a fi- 
nite number of rational operations and extraction of radicals, continued till 
the end of the eighteenth century. In 1770 a fundamental work by Joseph- 
Louis Lagrange (1736-1813) appeared in the Nouv. Mem. de VAcad. de 
Berlin. There he analyzed the methods for solving equations of degree at 
most 4 and set a new basis for the study of higher order equations. In 
1799 Carl Friedrich Gauss (1777-1855) provided a first rigorous proof of 
the fundamental theorem of algebra (incomplete proofs had been given 
by Jean d’Alembert (1717-1783), Leonhard Euler (1707-1783), and Gauss 
himself). Prom 1799 with the treatise Teoria generate delle equazioni in 
cui si dimostra impossibile la soluzione algebrica delle equazioni di grado 
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Figure 5.10. Niels Henrik Abel (1802- 
1829) and the frontispiece of a Memoria 
by Paolo Ruffini (1765-1822). 
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superiore al quarto until 1813, Paolo Ruffini (1765-1822) made several at- 
tempts to prove that general equations of degree higher than four could 
not be solved by radicals. Finally in 1824 the Memoire sur les equations 
algebriques, ou Von demontre Vimpossibilite de la resolution de V equation 
generale de cinquieme degre by Niels Henrik Abel (1802-1829) appeared, 
where a complete proof of Ruffini’s attempts was given. 

The problem then became that of deciding whether a specific equation 
was or was not solvable by radicals, and, if not, finding more complicated 
formulas: the fundamental ideas in this direction are due to Evariste Ga- 
lois (1811-1832) with further contributions, among others, by Enrico Betti 
(1823-1892), Charles Hermite (1822-1901), Leopold Kronecker (1823- 
1891) which led to the in some sense definitive treatise by Camille Jordan 
(1838-1922). But this would lead us far away from our main path. 



5.2.2 Distribution of the roots of a 
polynomial 



Let P be a polynomial of degree n with real coeflBcients, P{x) = 5Zfc=o 
Without solving the equation P{x) = 0 we would like to obtain information 
about the distribution of its roots. For instance, we would like to determine 
whether it has real roots, and if it does, how many; or how many positive 
roots it has, or how many real roots lie between given limits a and b. 
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a. Descartes’s law of signs 

In his book Geometria Rene Descartes (1596-1650) proved the following. 

5.25 Theorem. Let P{x) be a real polynomial If all its roots are real, 
then the number of its positive roots, counted with multiplicity, is equal 
to the number of changes of sign in the sequence of its coefficients. 

The number of changes of sign is defined as follows: we list all coef- 
ficients in order of decreasing power index, including Un and qq, cancel 
out the zeros, and consider all pairs of successive numbers in the list so 
obtained: if in such a pair the signs of the numbers are different, then we 
call this a change of sign. 

Proof. Of course we can assume an > 0. We start by observing that the sign of the first 
nonzero coefficient of P is (— 1)^, p being the number of positive roots of P, counted 
with their multiplicities. We can in fact write 

P(j)c'^ •— On^ ‘ * ‘ “I” aj^X (.27 ■“ Xp'^^JC • • * ^X •“ Xjf^ 

if P has 0 as a root of multiplicity k, x\, X 2 ,. . . ,Xp are the positive roots of P and 
Xp^i, ... ,Xn-k are the negative ones. 

The proof now proceeds by induction on the degree of P. The claim is trivial for 
n = 1. Let us suppose it for polynomials of degree n — 1 and consider a polynomial 
= lCfc=o degree n. 

If ao = 0, then P{x) = xQ{x). Since the number of positive roots and the number 
of changes of sign of P and Q are equal and the claim holds for Q, it holds for P. 

Suppose ao # 0. It is clear that the number of changes of sign in P{x) is equal to the 
analogous number for the derivative P'(x), if the sign of ao and the last coefficient of 
P' coincide, or it is one more if the signs are opposite. By the remark at the beginning 
of the proof, in the first case the numbers of positive roots of P and P' have the same 
parity (are both even or odd), and in the second case they have opposite parity. On 
the other hand the number of positive roots of P can be either equal to the number of 
positive roots of P', or one more. Therefore in both cases the difference between the 
changes of sign and the number of roots of P and P' is the same. 

Since the number of positive roots of P' is equal to the number of changes of sign 
in the coefficients of P', by inductive assumption, the claim is proved also for P. □ 

5.26 Remark. Actually one could also show: if P{x) has also complex roots, then the 
number of positive roots is equal to, or an even number less, than the number of changes 
in sign in the coefficients. 



b. Sturm’s theorem 

Descartes’s law of signs does not give an answer to our initial question to 
know the number of real roots of a given polynomial in a given interval. 
After many attempts, this problem was solved by Jean-Charles-Prangois 
Sturm (1803-1855) in 1835. He considers the problem of localizing the 
sets of zeros of a polynomial disregarding multiplicities. For this problem 
it suffices to consider the case of polynomials with only simple roots, since 
for any polynomial P, Pq := P(x)/g.c.d. (P(x), P'{x)) has the same set of 
roots of P, and Pq and Pq are coprime. So we can and do assume from 
now on that P and P' are coprime. 
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We now apply Euclid’s algorithm to P and P' with a slightly different 
notation to find the list of polynomials {Qk} given by 

I Qo(x) = P(x), Qi{x) = P'{x), 

= qk(x)Qk{x) - Qk+i{x), degQk+i < degQk, 

till the last one, Qe{x), that is nonzero. P and P' being coprime, Qi{x) is 
a nonzero constant. The list of polynomials 

{<3o(a:), Qi{x),..., Qe{x)} 

is called the Sturm^s sequence of P. 

For a G R we denote by V (a) the number of changes of sign in the list 

{Qo(a), Qe{a)} 

not counting possible zeros. 

5.27 Theorem (Sturm). The number of roots of P in [a, 6 ] is equal to 
y(a) - V{b). 

Proof. The idea is to look at how F(0 changes when ^ moves from a 
to b. Let I be the interval between two consecutive roots of the Q^’s, 
i = 0 , . . . , f — 1 . By continuity sgn (Qi(0) is constant in /, from which we 
infer V(^) is constant in I. Thus V'(^) is a piecewise constant function on 
[a, b] that eventually jumps at the roots of one of the Qi’s. 

Let us now look at V{^) when moving from a to 6 , passes through 
one of the roots of the polynomials Qo> • • • ? Qi-i- First we observe that if 
Qi{€) = Oj then Qi-i(0 and Qi+i(0 are nonzero, since Qj{^) = Qj+i(0 = 
0 and (5.17) would give (5j+2(0 = t), . . . , Q^(0 = 0: a contradiction. 

(i) Assume then 

Qo(7) = 0 , Qii-y) ^ 0 , . . . , Qe-i{j) 0 , Qt{i) ^ 0 . 

The continuity of the QiS yields a J > 0 such that 

sgnQi (7 -S) = sgn( 3 i (7 + 5) for z = 1 , . . . , ^. 

Since Qi( 7 ) 7 ^ 0 , Qo is monotone in a neighborhood of 7 , we may assume 
sgnQo(7-<5) = -sgnQi(7-(5) sgn(5o(7 + <^) = sgnQi (7 + 5). 

In this case, we therefore conclude 

F(7-5)-y(7 + 5) = l. 

(ii) If instead 7 is a root of 

Qi{x) = 0 for some z = 1 , 2 , . . . , ^ — 1 , 

then (5.17) yields Qi-i( 7 ) = — Qi+i( 7 )? therefore Qi-\ and have 
opposite sign in a neighborhood of 7 , say [7 — 5 , 7 + 5], since they are not 
zero. Hence the following two tables are possible 
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X 


7 — (5 


7 


7+5 


X 


7 — <5 


7 


7+5 


Qi-\ 


+ 


+ 


+ 


Qi-\ 


- 


- 


- 


Qi 


± 


0 


± 


Qi 


± 


0 


± 


Qi+l 


- 


- 


- 


Qi+l 


+ 


+ 


+ 



In this case we then conclude that 

F (7 - (5) - F (7 + 5) = 0. 

Prom (i) (ii), we infer that V{^) jumps at 7 if and only if 7 is a root 
of Qo) and the jump is +1, thus concluding that V{a) — V{b) equals the 
number of zeros of Qq = P. □ 



5.3 Rational Functions 

5.28 Definition. A rational function R{z) is the quotient R(z) = 
A(z)/B(z) of two polynomial functions A, B : C C. R{z) is therefore 
defined for all z £ C such that B(z) ^ 0. 



a. Decomposition in C 

Hermite’s formula allows to decompose every rational function A{z)/B{z) 
as the sum of a polynomial (which is nonzero if and only if deg A > degB) 
and of simple rational functions^ that is of functions of the type 

A 

{z — a)^ 

where A € C, k eN, and a is a root of B. 

Of course we can assume that A and B are coprime. Also, if deg A > 
degJ5, we may divide A by J5 obtaining A{z) = B(z)Q(z) + R{z) with 
deg R{z) < deg B{z), i.e., 

Therefore in the sequel of this section we shall always assume that A and 
B are coprime and deg A < degJ5. 

Let ai, CI 25 • • • ? % be the roots of B with multiplicities fcx, ^ 2 , . . . , 
so that 

B{z) = {z- ai)^^{z - a 2 )^^ • • • (z - fci -f fc 2 H h fcp = n. 

Fix one of the roots, denote it by a and denote its multiplicity by fc, so 
that 
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B{z) = {z-afQoc{z), 



while Qa(/5) = 0 for any root j3 oi B with (3 ^ a. Notice also that, by 
Taylor’s formula, 



Qa{^} 



D^B{a) 

k\ 



Hermite’s decomposition formula is a consequence of the following 
lemma, which allows us to decrease the degree of the denominator. 



5.29 Lemma. Let a be a root of B with multiplicity k, and let Qa{z) be 
such that B{z) = {z - a)’^Qa{z). We have 



■A(z) ^ ^oc,k R{z) 

B{z) ^ (z-a)'=-iQ„(z) 



(5.18) 



where 



_ A{a) _ k\A{a) _ A(z) - Xg,kQ{z) 

Qa{oi) D*‘B{a)' z — a 

Moreover deg R < deg B — 1 and R{z) and Qa a,re coprime. 

Proof. We have 

A(z) _ \Qa{z) A{z) - XQa{z) ^ A A{z) - XQa{z) 

B{z) B{z) ^ B(z) (z - aY {z - aYQ^{z) ' 

Since >l(a) — A(5a(a) = 0 if A = Ac,^, we have A{z) — Xa,kQa(z) = 
R{z){z - a), proving (5.18) and that degR < degB - 1. It remains to 
prove that R and Qa are coprime. In fact, if /? is a common root to R 
and Qa, then A{/3) = A{/3) — Xa,kQa{f3) = R{f3){/3 — a) = 0, hence ,3 is a 
common root to A and B, a contradiction. □ 

Iterating Lemma 5.29 we get the following. 

5.30 Theorem. Let A^B be coprime polynomials with deg .A < deg.B 
and let a be a root of B with multiplicity k. Then we can find Xoc.k, 
Xa,k-iy •••? Xa,i G C and a polynomial Ra coprime with Qa(z) with 
deg Ra < deg B — k, such that 

Xgj Rqjz) 

B{Z) Qa{z)‘ 

Actually {Aa,fc} and Rg can be computed as follows: Let Qg{z) be such 
that B{z) = {z — ot)^Qa{z)y Qa{(^) 0. Set Rg,k{z) := A{z), compute 

iteratively for j = fc, . . . , 2, 1, 

{ Aq!,j • (^)/^a (^) ? 

^a,j — l{z^ 1= (^RgJ XgjQg(^Z))/(^Z Q), 
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Figure 5.11. Charles Hermite (1822- 
1901). 



and set Ra{z) := Ra,o{^)- 

Finally, observe that, for any root (3 ^ a, we have 



Ra{/3) = 



AW) 

W - a)'' ' 



(5.19) 



5.31 Theorem (Hermite). Let A,B be coprime polynomials in C[z] 
with deg A < degB =: n. Let k(a) be the multiplicity of the root 
a of B. Then we can Snd uniquely determined complex numbers Aajf 
j = 1, , k(a) such that 



A{z) 

B(z) 



fc(a) 

E E 



a root of B jf=l 



{z — ay 



(5.20) 



where the can be computed as follows: let Qa{z) be such that 



B(z) = (z-a)^^^^Qa(z), Qa(oi) ^ 0. Set Ra^k{a){z) •= i^hen compute 
iteratively for j = k{a), ... ,2,1 



*— Ra,j(s^)/Qa{pt), 

1 (- 2 ^) •“ {^aj ^ajQa(^))/i^ ~ Oi). 



Proof. Uniqueness. Let a be a root of B with multiplicity k^. Multiplying 
(5.20) by {z — a)^“, we characterize Xa.koc 



A 



ol 



A{z){z - g)*’°' ^ A{a) 
z-^a B{z) Qa{aY 



Proceeding iteratively, the uniqueness of the decomposition follows. 
Existence. The existence of the decomposition follows applying Theo- 
rem 5.30 to the roots of B{z) ordered in an arbitrary order. Moreover, 
because of the uniqueness, we get the same decomposition starting from 
an arbitrary root, hence the algorithm. □ 
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5.32 Remark. If the roots of B are distinct, Hermite’s formula becomes particularly 
simple. Let A and B be coprime and deg A < deg B. Denote by ai , Q2, . . . , o:n the roots 
of and set 

Qi(z) := Qaiiz) = B{z)!(z - ai) 

SO that 

— B (oii), = 0 for j ^ i; (5.21) 

then we have 

A{z) __ ^ Xj 
B{z) j^^z-aj 

where 

^ A{aj) ^ A{aj) 

Qi(aj) B'iajY 

A direct proof of Hermite’s formula in this case can be done as follows. Write 
l/(z — a j) = Qj{z)lB(z) so that 



^ z-oij B(z) 



(5.22) 



The degree of the polynomial C{z) := ^jQj(^) is less than n, moreover for all 

i = 1, . . . , n (5.21) yields 



Qj{o^j) 

Since C and A agree on n points, we have C = A and the claim follows from (5.22). 



5.33 Example. Suppose we want to decompose 

1 






We start with the root 1 with multiplicity 3. In the algorithm of Theorem 5.30 A(z) = 1, 
Qi{z) = + 1 and Rsiz) = A(z) = 1. Then we compute 



Ai,3 = 



^( 1 ) 

Qi(i) 



R2{z) = {R3{z)--Qi{z)):{z-1) 



(1 - + 1)) : (z - 1) = -1(^2 - 1) : (z - 1) = -hz + 1), 



Ai,2 = ii2(l)/Ql(l) = --, 

Rl{z) = (R 2 {z) + i(z2 + 1)) : (z - 1) 

= (-i(z + 1) + i(z2 + 1)) : (z - 1) = ^z(z - 1) : (z - 1) = ^z, 



1 



Ai,i =Hi(l)/Qi(l) = 

R{z) = Ro(z) = (i?i(z) - i(z^ + 1)) : (z - 1) = -i(z - 1)^ : (z - 1) = -i(z - 1), 
4 4 4 



finding 
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It remains to decompose 



1 

+ - 



z-1 



2 (z-l)3 2 ( 2 ^ -1)2 A z-1 4{z‘^-\-l)’ 



z — 1 



4(z2 + 1) ■ 

As z^ 1 = {z — i)(z i), the roots of the denominator are i and —i with multiplicity 
1. Therefore it suffices to compute 



, -(z-l)/4^ 

Xi^i = : for z = i, 



z i 



, -{z - l)/4 , 

A_i,i = : for 2 = — z, 



that is = — 1(1 + z), X-i^i = — ^(1 — z), to conclude 



4- ■ 



(z2 + l)(z-l)3 2 (z-l)3 2 (z- 1)2 ’ 4 2-1 



1 1 + z 
8 z — i 



1 1 -z 
8 2 + z 



Alternatively, we can proceed as follows. Prom Hermite’s rule we know the existence 
and uniqueness of a decomposition 



+ 



(22 + 1)(z-1)3 (2-1)3 



c d e 

_l _l _l 

2—1 2 — Z 2-1-Z 



for suitable a, 6, c, d, e G C. We can compute those coefficients by reducing to the com- 
mon denominator. This way the polynomials in the numerators have to be equal, hence 
their coefficients have to be equal, by the principle of identity of polynomials. This 
yields a system of five linear equations in a, 6, c, d, e that, once solved, yields the values 
of a, 6, c, d, e. 



b. Decomposition in R 

If A{x)^ ^ ^[x] are two polynomials with real coefficients, one can 

decompose A{x) /.B(x) as a sum of a polynomial (that is not zero if and only 
if deg A > degB) and of simple rational functions with real coefficients. 
In fact, recalling that nonreal roots of B come in couples of conjugate 
complex numbers, by the complex Hermite’s formula. Theorem 5.31, we 
infer 



A{z) 

B{z) 



k{a) 



E E 



^oc,J 



a root of B j 
aGK 



^ - ay 



(5.23) 



k(a) k(a) 

, y-' ■^aj , y^ y^ '^aj 

^ ^Ax- ay ^ (x- ay 

7 = 1 ' ' a root of B 7 = 1 ^ ' 

3(a)>0 



ot root of B 'f — 
9‘(a)>0 



for all z G C with B{z) ^ 0. Going through the iterative scheme in The- 
orem 5.31 which yields the taking into account that A and B have 
real coefficients, we infer 

Qa(^) — Qa(^)y ~ 

hence from (5.23) the Hermite decomposition formula in R 
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A{x) 



k{a) 



£ fx-’aV £^((a;-’ay) 



k{a) 






^(^x) ^ “ {x — ay ^ “ \{x — ay 

ck root of S -1 — j[ ' ' cx root of B 't — \ '' ' 

9(a)>0 



O' root of B H- 
oGR 



for X G M with B{x) ^ 0. 

Notice that if all roots of B are simple, formula (5.24) reduces to 



B{x) 



X ' X '' / ^ \ 

— a — a x — a) 



a root of B 
qGR 



a root of B 
3f(a)>0 



where := for every root a of B. Therefore 



5.34 Corollary. Let A(x) and B(x) E R[x] be coprime real polynomials 
with degu4 < degB. Suppose that B{x) has only simple roots. Then 



B{x) 



E 

a. root of B 
oGR 




+ 



E 

a root of B 
3(a)>0 



ia{x-pa) + mg 
- 2paX + qa 



where Aq := for every root a gC of B and, if Q(a) > 0, 

Pa:=^{a), Qa:=\a\^, £a ■■= 2dt{Xa), Ma := -29(A„)3(a). 

c. Integration of rational functions 

Of course, Hermite’s decomposition allows us to integrate and express 
the indefinite integral of every rational function in terms of elementary 
functions. Here we just state the following. 

5.35 Proposition. Under the assumptions and with the notation of 
Corollary 5.34^ we have 

f T. Aalog|x-a| 

a root ofB 
ot£R 

+ ^ f 3 J(Ack) log(a:^ — 2paX 4- Qa) “ 2 Q(Ack) arctan 

aroot ofB ^ 

3(q)>0 



/X - U{a) 
V ^(a) 



5.36 Example. Let us compute 



/. 



+00 ~2m 



• dx, m, neN, m<n. 



1 + x2" 

The roots of B{x) = 1 + are the 2n-th roots of —1, 
a {■ 2j + l^ 



j =0, ...,2n- 1. 
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The roots with positive imaginary part are (3j, j = 0, . . . ,n — 1. According to Propo- 
sition 5.35, we need to compute the numbers fjLj := x^^lD{l + x^'^) at x = /3j. Since 



(3- 1 2 



|”'+' = -^exp(ia(2j + l)) 



2m + 1 

a := 7T. 

2n 



We remark that 



q;(2j + 1) + a:2(n — 1 - j) + 1 = 7r(2m + 1), 



arg (/ij) + arg (fin-i-j) = 

Also, since |/Xj| = 1 Vj, we have Finally, taking into account 

that pn-j-i = ^iPn-i-j) = = —Pji Proposition 5.35 yields 



+00 ~2m 



/ +00 
-oc T 



= ^ SR(M,)log( 



_ 2pj ;r + 1 
2-2 ^ + 1 



I -f-oo n— 1 






1-00 



n — 1 n—L 

= -2 IT E = -2 ir 3:( /i,) . 



It remains to compute ^^^=0 Mi- Set k := exp (ia) and q := exp (i2a), so that 

Em.-=-eV' = ^^^^. 

^ 2n ^ 2n 1 - 9 

j=o j— 0 ^ 

Since = exp (i7r2 "^^^ n) = exp (t7r(2m -f 1)) = exp (iw) = —1, we deduce 



= r 



2n — e 2n sin a 



Therefore from (5.25) and (5.26) we conclude that, for n, m E N, m < n, we have 



/ 4-00 , 

-00 T- 



1-dx-- ^ “ - 1 

n sin a n • f 2m + 1 E\ 

V 2n / 



5.37 % Following the same path of the Hermite formula for complex rational func- 
tions, prove the following. 



Lemma. Let A, B be two polynomials with real coefficients which are coprime in R[x] 
with deg A < deg B, and let a be a nonreal root of B of multiplicity k. Then 

A{z) _ a(z -p) + b R(z) 

B(z) (z^-2pz-hq)^ (z^ — 2pz -h q)^~^Q(z) 

where, if X := A{a)/Q{a), then a = b — 3f?(A), and 

^ A{z) - (a(z - p)+ h)Q{z) 
z^ — 2pz + q 

is a real polynomial with deg R < deg B — 2. 
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Iterating the previous lemma over k and all roots of B, show the following. 

Theorem. Let A, B be two polynomials with real coefficients which are coprime in 
M[x] with deg A < degB. Assume that B has no real root and denote by k{a) the 
multiplicity of the root a of B. Then 

A / \ k(cx) , . . 

0'a,j(z — Pa) Og,j 

C + gc,y 

Qf(a)>0 

where pa = Qa = |o:p) 9,nd the a^j and baj can be computed by the following 

procedure: Set Qa{z) = B(z)l{z'^ — 2paZ + Set Rct,k{z) := A{z) and compute for 
j = k,k-l,...,l, 

O •= Ra ,j (P) / Q a (c^) } 

^ ^<x,j •” )/^(^)> ^ct,j ~ 

^ Ra,j — 1 (-2^) {Ra,j (■2^) {c,cc^j (z Po : ) “1“ ^q:,j ^ Qot (•2^)) j (-2^^ ^Pa.Z “t" ) • 

5.38 Prove the following. 



Theorem. Let A, B be two polynomials with real coefffcients which are coprime in 
R[x] with deg A < degB. Let N := g.c.d. (B, B') and S ;= B/N. Then there exist 
polynomials M and R with real coefficients with deg M < deg N, deg R < deg S such 
that 

A{x) _ d /M{x)\ R{x) 

B{x) ~ ^ V N{x) ) S{x)' 

[Hint: Use Hermite’s formula (5.23).] 



5.4 Sinusoidal Functions and Their 
Sums 

The existence in nature of periodic phenomena, i.e., phenomena that recur 
after some time, attracted the attention and the interest of man and prob- 
ably was one of the main starting points for organized knowledge. Next to 
the totally fortuitous events of life, there stood out a number of more or 
less regular phenomena that were predictable. Seasons, the apparent mo- 
tion of the moon, of the sun, of fixed stars and planets could be predicted 
by everyone. On the other hand, other phenomena such as solar and lunar 
eclipses could be predicted only by a few people, usually the high priests. 

With the creation of modern science and the systematic use of calculus 
to investigate reality, several periodic phenomena were studied in detail 
(oscillations, vibrations, waves) and, once more, the ondulatory model be- 
came relevant in the nineteenth century in order to understand the nature 
of light and electromagnetic radiation. 
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5.4.1 Trigonometric polynomials 

An elementary periodic phenomenon is clearly the uniform circular motion, 
described as we know (compare, e.g., [GMl]) by the functions sine and 
cosine. 

5.39 Definition. A sinusoidal signal, or a circular or harmonic function 
of pulse or angular frequency u is a solution of the equation of simple 
harmonic motion x"(t) + cu^x(t) = 0. 

All harmonic functions with pulse cj ^ 0 have the form 

acoscut -h bsinutj a, 6 gM, 

compare [GMl], or, if we set A := and <p is such that cos<^ = 

ajyjo?' + 6^, simp := —b/\/a^ + 6^, all functions of the form 

x{t) = Acos(o;t + ^), A > 0. 

A is the amplitude and ip the phase at time t = 0 of the circular motion 
x{t). As we have seen, complex notation yields simpler formulas, since, by 
Euler’s formulas 

x(t) = Ae^^e^^K 

The phase is of course defined modulo an integer multiple of 27 t, there- 
fore, if we want to have a definite number, we need to fix a determination 
of the angle. 



a. Periodic functions 

5.40 Definition. A function f :R is said to be periodic of period T 
if/(t + T) = /(t) Vt€M. 

If / and g are periodic with period T, then f + g and fg are periodic of 
the same period T; moreover if / is differentiable, then /' is also periodic 
of period T. 

5.41 In general, the primitive of a periodic function is not periodic, for instance, 
1 + cost is 27T-periodic, but t + sint is not. Show the following. 

Proposition. Let f be a periodic, continuous function with period T. Then Jq f{t) dt 

is periodic of period T if and only if f has integral mean zero over a period, f(t) dt = 
0 . 



If / is a periodic function of period T, then / is also periodic with 
periods kT, fc G N, fc > 1. A sinusoidal signal with pulse a;, f{t) := 
A cos{ojt+(p) has a minimum period T := 27r/a;, and then it is periodic with 
all the periods k €N, k > 1, Notice however that not every periodic 
function has a minimum period. For instance the Dirichlet function 
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D{x) = 



1 if X G ' 
^0 if X ^ ' 



is periodic with period q for all q G Q+. The “minimum period” should 
then be zero, but this is meaningless. 



b. Trigonometric polynomials 

As we shall see in Theorem 5.55, the sum of sinusoidal signals of pulses lji 
and lv 2 is periodic if and only if ujifuj 2 is rational, that is, if both pulses 
are integer multiples of a fundamental pulse. 



5.42 Definition. A trigonometric polynomial of degree n and period T 
is a periodic function of the form 

^ 0 0 

P(t) = y + X] ’ * S M, (5.28) 



with ak,bk G M. 



Notice that the quotients of the frequencies k/T of the components axe 
rational. The terminology that follows comes from acoustics. The human 
ear considers as harmonic the sounds that have rational quotients of pulses 
and, actually, with quotient p/q with p, q small. Then 

(i) ao/2, or more precisely the constant function x — > ao/2, is the con- 
tinuous component of P, 

(ii) the function t — > ai cos (^t) + bi sin {^t) is the fundamental har- 
monic of P; the function t Ok cos {^kt) + bk sin fc > 2, is 

the A;-th harmonic of P. 



If we set 



Afc := 



ao/2 if fc = 0, 

Val + b\ iffc = l,...,n 



and (fk is such that 



cos (pj^ — a/j 






sinipk = 






we can write P as 

^ ^ / 27T \ 

P{t) = Ao + '^Ak cos + ‘fk) ■ 

k=l 

The lists {A/J, fc = 0, . . . , n, and {pk}^ fc = 1, . . . , n, are called respectively 
the amplitude and the phase spectrum of P. 
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When dealing with trigonometric polynomials, the complex notation is 
useful to write formulas which are handier to manipulate. In fact, taking 
into account Euler’s formulas, we can write P(t) in (5.28) as 

n 

P{t) = t e M (5.29) 

k=—n 

where 

Ck = l^ = Ao if fc = 0, 

cZk if k < —1. 

Observe that, while each term is complex, the sum 

is real for each k E {— n, . . . , n}. 

More generally, we set the following. 

5.43 Definition. A complex trigonometric polynomial of degree n and 
period T is a periodic function with complex values ^ P : K — > C, of the 
type 

P{t)= Cfcexp fctj, tGl, (5.30) 

k——n 

where Ck E C for k = — n, . . . , n. The vector {ck}-n<k<n ^ is called 

the complex spectrum ^ or simply the spectrum of P. 

The class of all complex trigonometric polynomials is denoted by Pn,T- 

Notice that f -t g and A/ € Pn,T if f^g ^ Pn,T and A G C, or, as we 
say, Pn,T is a complex vector space. Finally, notice that P(^t) € Pn, 27 r if 

P ^ Pn,T- 

c. Spectrum and energy identity 

By definition, a complex trigonometrical polynomial is defined uniquely 
by its spectrum, and the surjective map # : » Vn,T given by 

^ 2tt 

(c_„,...,c„) — > f{t)-= X] Cfcexp(i^^^) (5-31) 

k=—n 

is linear. Thus Vn,T is a vector space of dimension less than or equal to 
2n H- 1. A relevant fact is that one can compute the complex amplitudes 
of the harmonics from the sum of the harmonics, thus proving that # is 
injective and therefore that the dimension of Vn,T is 2n + 1. 
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5.44 Proposition. Let P{t) = Ylk=-n ^ ^n,T* 

(i) For all k = — n, . . . , n we have 

Ck = ^£ (5.32) 

Consequently Vn,T has complex dimension 2n + 1. 

(ii) The energy equality holds 

"^\P(t)\^dt= ( 5 - 33 ) 

k=—n 

The proof of Proposition 5.44 is a simple consequence of the following 
computation. 




5.45 Lemma. Let k eZ. Then 



T 




dt = 



{ 



0 ifk^ 0 , 

1 ifk = 0. 



Proof. If fc = 0, we have = 1, hence ^ dt = 1. If A: ^ 0, 

writing = cos +zsin(^fct) and noticing that cos0 and 

sin 0 have zero mean average over an interval of size 27 t, we conclude that 

^ = 0 . □ 

Introducing the Kronecker symbol Shk^ 



^hk 



1 if h = k, 

0 if /i ^ fc, 



Lemma 5.45 yields 

Proof of Proposition 5.44- (i) Prom (5.34) 



V/i, keZ. 



(5.34) 



1 

f 






Che 



,i ht p — i 



dt 



= S = S ChShk = Ck. 



(ii) We have 
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k=—n 



h——n 



h,k=—n,n 



Thus we conclude from (5.34) that 



1 

T 




^ ^ ^k^h^kh 
h,k=—n,n 



Y = Y 

k=—n k=—n 



□ 

5.46 Remark. Let / G (^^(E) be periodic with period T. The energy of 
f is defined to be 

^ T 

^ Jo 

Of course the energy of the continuous component of P is |cqP, while the 
energy of the A:-th harmonic of P{t) is 

f |cfcC*^''‘ = |cfcp + |c_fc|^. 

Jo 

The energy identity can therefore be restated as: the energy of a trigono- 
metric polynomial is the sum of the energies of its components. 

d. Sampling 

Let P{t) = ^ Pn,T be a complex trigonometric polyno- 

mial of order n. Trivially P{t) = P(exp(i^t)), where R is the rational 
function 

n 

■^(^) •= Y 

k=—n 

Observing that R{z) = N{z)/z^ where AT is a polynomial of degree 2n, 
and taking into account the principle of identity of polynomials, we infer 
the following. 

5.47 Proposition. Let P, Q e Vn,T- Suppose that P and Q agree on 
2n + 1 distinct points in [0, T[. Then P(t) = Q(t) for all t G R. 

Not only is this true, but there is an interpolation formula that permits 
to reconstruct P{t) from its values on a suitable choice of 2n + 1 points in 
[0,T[. This is an easy version of the sampling theorem of Claude Shannon 
(1916-2001). In order to show that formula, we introduce Dirichlet^s kernel 
of order n as the trigonometric polynomial in Pn, 27 r defined by 

n n 

D„{t) := Y 1 

k=—n k=l 



t G R. 



(5.35) 
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5.48 Proposition. We have 

(i) Dn{t) is an even function Dn{—t) = Dn{t), 

(ii) Dn(0) = 2n + 1 and Dn{'K) = (-1)"", 

(iii) for t ^ 2k7Tj k we have 



Dn{t) 



sin((n + l/2)t) 
sint/2 



(iv) Dn(t) vanishes at tj = if i € Z and j is not a multiple of 

2n + 1. In particular, if j G [— 2n, 2n], then 




if j 7^ 0, 
if j = 0. 



Proof, (i), (ii), and (iv) are trivial, (iii) can be proved by induction or even 
directly. In fact, on account of 

^ 1 — 

= 1 + z-\- -\ \- z^ = — , Z^l, 

^ \ — z 

fc =0 



one computes 






1-Z 



271 - 1-2 



z e 



hence 



D„(t) = e 



2"(1 - Z) ’ 

1 _ ei(2n+i)t ^ 



—int 



1 - 



sin(t/2) 



(5.36) 

□ 



5.49 Theorem. Let Q{x) G Vn,r- Set P(t) := Q{^t) G Vn, 2 n- Then 

P{t) = 2^ E - tj) Vt € M, 

j=-n 

where tj := j G {-n, . . . , n}. 

In other words, we can reconstruct P{t) from the values P{tj) of P 
at tj . 

Proof. Since the formula is linear with respect to P, it is enough to prove 
it only for the functions h = — n, . . . , n, that form a basis for Pn, 27 r- 
From the definitions of and of {tj}, we deduce 




180 



5. Polynomials, Rational Functions and Trigonometric Polynomials 



5] e*'**^Dn(x-x,)= 

j=—n k=—n 



j=—n 



n n 



= E E ^ikx ^i{h—k)xj _ ^ ^ ^ikx ^ 



i{h—k)xj 



j=—n k=—n 



k——n j=—n 






k=—n 



Since h — k e [— 2n, 2n], (iv) of Proposition 5.48 yields 

Ott 

zn + 1 



hence 






k=—n 



5.50 Example (Euler’s formula). Let us prove that 

r^Mdt=^. (5.37) 

Jo t 2 

We recall that the integral in (5.37) exists as an improper integral, 

sinx , sinx , 

/ ax := lim / ax; 

Jo X v-*ooJq X 

see, e.g.. Example 4.81 of [GMl]. Therefore it suffices to compute the limit of dt 

as n — > 00 where {xn} is a sequence that diverges to +oo. 

Prom (5.35) and (5.36) we infer 



sin(2n + l)t ^ r. ^ r.i 

— = 1 + 2 cos 2kt 

sint ^ 



and, integrating over [0, tt/2], 

I 



k=l 



sin(2n + l)t 7 t sm(2A;t) 

at = f- 2 > 

sin t 2 2k 

fc=i 



0 " 2' 



Therefore 

7 T r(2»^+i)f sint sin((2n + l)t) /*^/2 sin(2n + l)t 

2 Jo t Jo sin t Jo 

r7r/2 ^ _ sint 






dt 



tsint 



■ sin(2n + l)t dt 



/•Tr/2 

= / /(t) sin(2n + l)t dt 

Jo 



(5.38) 



where /(t) = Since /(t) is of class C7^ in [0, tt/2], we can integrate by parts the 

last integral in fs?38) finding 

r/ \ . / \ j .. . cos(2n + l)t , cos(2n + l)t 

/ /(t)sm(2n + l)tdt = -/(t) — — — +/ / (t) — — — — dt 0 

Jo 2n + 1 Q do in + 1 

as n — 00 , i.e., 

r(2n+l)-| ^ 

/ dt 

Jo t 2 
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5.4.2 Sums of sinusoidal functions 

5.51 Definition. A finite sum of complex sinusoidal functions is a func- 
tion / : R — > C of the form 



m = E Cjexp{iujjt), t G R, 
i=i 

where cj G C, uj G R, Ui ^ uj for i ^ j and j = 1, . . . , n. Each addend 
is referred to as a component of /, cj is the amplitude of the component 

and the vector / := (ci, . . . , Cn) G is the spectrum of f. 

The sum of complex sinusoidal functions identifies again the coefficients 
of its components. In fact, we have the following. 



5.52 Theorem. Let f : R ^ C be the finite sum of complex harmonic 
functions 

n 

fi*) = 

j=i 



Then for any ti; G K, 



1 \c 

OM / 

JV^+OO 2N I 0 



if uj = ujj for some j = 1 , . . . , n, 
otherwise. 



The claim in Theorem 5.52 follows easily from the following 



5.53 Lemma. For all lu gR we have 



1 

lim / exp (iujt) dt = 
^-+oo 2N ^ ^ ^ 



if a; ^ 0 , 
if a; = 0 . 



Proof. If a; = 0 the claim is trivial. Suppose u; 7 ^ 0, let T := 27t/u; and let 
k be the largest integer such that kT < iV, so that N — kT < T. Since 
exp (iut) has zero average over one period interval we deduce 



/ kT 

exp {iujf) dt = 0 , 

-kT 



hence 



rN 



/ exp (iujt) dt 
-N 



/ -kT pN 

exp (iujt) dt-\- exp (iut) dt 
-N Jk 



-N 

r-kT 



kT 

N 



/ — Kl 

I exp {iujt) \dt+ I exp (iut) \ dt 

-N JkT 



< 2T. 



Therefore we conclude | ^ exp {iojt) dt\ < ^ ^ 0 as N + 00 . □ 
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Proof of Theorem 5. 52. Since 



1 

^ J N dt = 




exp {i{ujj — u)t) dt^ 



the conclusion follows from Lemma 5.53. 



□ 



In particular, Theorem 5.52 yields the following. 



5.54 Corollary. If the finite sum of complex sinusoidal functions is zero 
in R, then the coefficients of all components are zero. 

Another proof of Corollary 5.54- We give here a direct alternative proof based on in- 
duction on the number of components. The claim is trivial for n = 1. Let us prove it 
for n = 2. Suppose 



Cl exp (iuJit) -|- C 2 cxp (iuj2t) = 0 Vt, u}\ ^ 

Multiplying by exp {—iuj2t) we find ciexp — uJ 2 )t) + C 2 = 0 and differentiating 

ici(a;i — a;2)exp — u)2)f) = 0 Vt 

which yields ci = 0 since U 2 / u;i . Consequently also C 2 = 0. 

Suppose the theorem holds for the sum of n — 1 complex harmonic functions and 
let 

n 

fit) = 

Multiply by ex.p{—iujnt) and differentiating we find 



n — 1 

icj(ujj — a;n)exp (iufjt) = 0; 

therefore Cj{ujj — cJn) =0 for all = 1, . . . , n — 1, because of the inductive assumption. 
Since Uj ^ ujn, we infer cj = 0 for all j = 1, . . . , n — 1 and consequently also Cn = 0. □ 

Though sinusoidal signals are periodic, finite sums of sinusoidal signals 
are periodic if and only if the sum is a trigonometric polynomial. In fact, 
we have the following. 

5.55 Theorem. The sum of sinusoidal signals is periodic if and only if 
the quotients of the pulses of the components are rational. 

Proof. Let f{t) = Cj^xp {iuojt) be the sum of sinusoidal signals with 
uji ^ Uj for i ^ j and Cj ^ 0. Write Tj = 27r/ujj for the period of the j-th 
component. 

We may and do assume that ui ^ 0. If Uj = uJiPj/qj for j = 1, . . . , n, 
then pjTj := qjTi. Each Tj is then a submultiple of T := ^2 * ^3 • • * 
Consequently every component is periodic of period T, hence f{t) is peri- 
odic of period T. 

Conversely suppose that f{t) is periodic of period T > 0. We have 




5.5 Summing Up 183 



f{t + T) - f{t) = ^ Cj{exp (iojjT) - l)exp (iujjt) = 0, 

i=i 

then Corollary 5.54 implies exp{i(jjjT) = 1. It follows that for every 
j = there exists an integer kj such that ujjT = 2kj7r^ i.e., the 

component’s pulses have rational quotients. □ 



5.5 Summing Up 



Polynomials 

Let A, B ^ K[a;] be two polynomials with coefficients in the field K. B divides A if 
A = BQ. A is said to be irreducible if no polynomial divides A but the constants. 
Two polynomials A and B are coprime if they do not have a common divisor but the 
constants. The greatest common divisor of A and B, g.c.d. {A, B), is the divisor of A and 
B of highest degree. All greatest common divisors differ by a multiplicative constant, 
and one of them can be quickly computed by Euclid’s algorithm. 

o Division algorithm. Assume that A and B are two polynomials in K[x] with deg A < 
degB; then there exist uniquely defined polynomials Q and R such that A{x) = 
B{x) Q{x) + R{x) with deg R < deg B. 

o Bezout identity. Given two polynomials A and B, there exist polynomials U,V e 
lK[x] such that A(x)U{x) + B{x)V{x) = g.c.d. (A, B){x). 
o Unique factorization theorem. Any polynomial in K[x] is the product of its irre- 
ducible factors. The decomposition is unique apart from the order of the factors, 
o Factor theorem, a G K is a root oi P e K, i.e., P{a) = 0, if and only if a:: - a 
divides P{x). Therefore P{x) has at most n distinct roots if degP = n. 
o Principle of identity of polynomials. Two polynomials P, Q with degree at most 
n must be equal if their polynomial functions coincide on n -f- 1 distinct points in K. 
o Fundamental theorem of algebra A polynomial P{z) = degree n 

in C[z] factorizes as a product of n polynomials of first degree 

P{z) = an{z - a\){z - 0 : 2 ) • • • (2 - an) in C[z]. 

o Factorization in R[x]. A polynomial with real coefficients of degree n in R[x] fac- 
torizes as a product of first and second order irreducible polynomials in R[x]. 



Polynomial equations 

There exist algebraic procedures that fully solve in C the polynomial equations of third 
and fourth degree, see 5.21 and 5.23. Polynomial equations of degree higher than five 
cannot be solved by radicals in general. There are however simpler rules to compute the 
number of real, positive roots of a real polynomial, see Theorem 5.25, and the number 
of zeros of a real polynomial in a given interval [a, 6] of R, see Theorem 5.27. 
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Rational functions 

Rational functions are quotients of complex polynomials 



R(z) := 



C(z) 

D{zY 



They are defined on C \ {2 | D{z) = 0}. By the division algorithm, R{z) decomposes 
as a sum of a polynomial, which is nonzero if and only if degC > degD, and of a 
rational function S{z) := A{z)/B{z) such that deg A < degB. We can now decompose 
A{z)/B{z) in simpler fractions, once the roots of B are known. 

o Hermite’s decomposition formula in C. We have 



Af X Koc) . 

Ayz) _ 

B{z) “ “ {z — ay 

^ ^ a root of B j = l '' ' 



(5.39) 



where the can be computed as follows: Set Qa{z) be such that B{z) = {z — 
Q,)fc(a)Q^( 2 ;) '\Ye then have Qa(«) ^ 0. Set R^^ ,k{ot){z) := A{z), then compute 
iteratively for j = k{a), . . . , 2 , 1 , 

Ra,j{oi)/Qcx{oi), 

Roc,j — i{z) := {R(x,j ~ ^oi,jQoc(z))/{z — a). 

o Hermite’s decomposition formula in R. Let A{x) and B{x) e R[x] with deg A < 
degB. Let degB =: n and denote by k{a) the multiplicity of the root a of B. Then 



B{x) 



k{a) 



k(a) 






Oi root of B 7: 
aGR 



root of B j = l 
Q(q )>0 



{x — a.y > 



(5.40) 



for all cc G R such that B{x) 7 ^ 0. The constants X^j are the same as in (5.39) 



Trigonometric polynomials 

A (complex) trigonometric polynomial is a function of the type 

•P(*) = ^ Cfcexp^i^t), teR 
k=-n ^ 

where T > 0 and, for k = —n , . . . , n, Cfc € C. The numbers Cfc are called the spectrum 
of P and obviously fix P. 

o The spectrum can be recovered from P by 

Cfc = ^^ dt, k = -n,...,n, 

o the energy equality holds 



1 r\p(t)fdt= \ckf. 

^ ->0 k=-n 

o Sampling. The trigonometrical polynomial P{t) G Vn,2Tr and its spectrum {c^}, 
k G {— n, . . . , n}, can be computed by sampling P at the points tk := 27rkl{2n + 1), 
k G {— n, . . . , n}. We have 
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Pit) = Pitj)Dn{t - tj) 'it e R, 

j = — n 

Cfc = J2 Pitj)e^''*t , k = -n,...,n 



where 

Dnit) ■■= 

k= — n 

is the DirichleVs kernel of order n. 



sin((n + l/2)t) 
sin(t/2) 



5.6 Exercises 



5.56 f . Let P{z) = a 2 z‘^ -\-aiz-\-ao and let ai, 0:2 be its two complex roots. Show that 
ai/a2 = Q!i + Q2) ao/a2=oiiOi2’ 



5.57 Let P{z) = a^z^ -f a 2 Z^ + aiz + ao and let o;i,a 2 ,Q :3 be its three complex 
roots. Show that 

Q!i + 0:2 + q; 3 = — a2/a3, a\a2 + aia^ + a20Ls = ai/as, aia20t3 = —ao/o,3- 

5.58 Suppose that the coefficients of anx'^ + an-ix'^~^ H h aix + ao are integers 

and that an = 1. Suppose that such a polynomial has a real root x. Show that either x 
is an integer or x is irrational. In the case x is an integer, notice that x divides ao- 



5.59 Suppose that the equation + px'^ + qx r = 0 has three real roots, and let 
d be the difference between the largest and the smallest root. Show that 



Vp‘^ - <d< ^ y/p^ - 39 . 



5.60 f. Let xo be a root of + an-ix^ ^ H hao = 0. Show that |a:o| < 1 + |cio| + 

• • • + \0"n-i\- [Hint: Consider separately the case |a:o| < 1 and |a:o| ^1-] 



5.61 1. Let P{z) = Ylk=o ^kZ^ be a polynomial of degree n and let ai, . . . , an be the 
n roots. Show that 

<7fe(ai, . . . ,Qn) = (-l)^an-fc/an, fc = 1, . . . ,n 

where cr^ (ai ,..., an) is the sum of all possible products of k roots, called the symmetric 
k- function. 



5.62 1. Every real polynomial which is nonnegative for all real x may be written in the 
form P^{x) -\-Q‘^{x) where P and Q are real polynomials. [Hint: Observe -\-q^){r^ + 
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5.63 % Lagrange’s interpolating polynomials. Given n+l points inC, xo,xi^ . . . ,Xn 
and n+l values 2/o? yij • • • > 2/n in C, show that a polynomial P of degree n such that 

P{xi) =yi i = 0, l,...,n 

necessarily has the form 

n 

P{x) = '^L^{x)yi 
j=0 

where the (x) are the unique polynomials of degree n, called Lagrange interpolating 
polynomials, such that 



Lj'(xi) = Sij, i,j = 0, 1, . . . ,n, 

Sij being the Kronecker symbol. Write explicitly the L”(a;)’s. 

5.64 f % IVigonometric solution of third degree equations. Consider the equa- 
tion x^ +px + q = 0 where p, g € M. 

(i) If g^/4 + 127 < 0 and p > 0, replace x with rx to obtain 

y^ + ^y+\=0- 



Comparing with the trigonometric identity 



o 3 ^ 1 

cos cos cos = 0, 

3 4 3 4 ^ 



write the roots in terms of trigonometric functions of (p. 

(ii) If q^/A + p^/27 > 0 and p > 0, set 



tan<p := 



p\32 

-) 0Cp<n, 



and find the roots as functions of 

(iii) If + p^/27 > 0 and p < 0, set 

sinv.:= 

and find the roots in function of <p. 



tan^ 



:= ^tan ■ 



0 < (p < 7T 



5.65 f . A hydrostatic approach to solve third order equations was proposed in 1898 by 
A. Demanet. Consider two communicating vessels, one being a circular cone of radius 
r and altitude a, the other a cylinder with basis of 1 square centimeter. If 



r 

a 




and if h is the altitude that is reached by c cubic centimeters of liquid, show that h 
solves the equation 

+ X = c. 




6. Series 



Processes of infinite summation or infinite series^ or, simply, series have 
appeared since ancient times. Aristotle (384BC-322BC) in his Physics 
seems to be aware that the geometric series 



1 + g + H 



has a finite sum if \q\ < 1. Later Frangois Viete (1540-1603) in fact com- 
puted (in 1593) 



l + q + q'^ + ■■■ 



1 

1-q' 



Zeno’s paradox of dichotomy clearly concerns the decomposition of 1 into 
the infinite series 



1 






2 






In medieval times, Nicole d’ Oresme (1323-1382) showed that the harmonic 
series 



. 1 1 1 



diverged. However, it was in the seventeenth century with the Calculus of 
Sir Isaac Newton (1643-1727) and Gottfried von Leibniz (1646-1716) that 
infinite series pervaded mathematics tremendously, especially as power se- 
ries. 

For Gottfried von Leibniz (1646-1716) and Sir Isaac Newton (1643- 
1727) and their contemporaries such as John Wallis (1616-1703), James 
Gregory (1638-1675), Brook Taylor (1685-1731), James Stirling (1692- 
1770), and Colin MacLaurin (1698-1746), functions were essentially infi- 
nite polynomials or power series,, and with them one operated to diflFerenti- 
ate and to integrate or compute areas, to calculate special quantities such 
as e and tt and the logarithmic and trigonometric functions, to interpolate 
series of data (which was particularly useful for navigation). For example, 
Leibniz found 




the right-hand side of which is now called a Leibniz series,, and Newton, in 
De analysi per equationes numero terminorum infinitas, found the series 
of log(l H- x), sinx, cosx, arcsinx, e^, . . . : as, for instance. 
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INTRODUCTIO 

!N A TS! N 

INFINITORUM. 

AV CTO JL£ 

LEONHARDO EULEEO, 

Jiw. 



TOMUS FHIMDS^ 




LAU5ANNJEf 
Aptd MjkRcuH'MiCHASLtK BoVE^Itllc &s6m^ 



MDCCXL V 11^ 



INSTITUTIONES 

CALCULI 

DIFFERENTIALIS 

CVH ilUl VlU 

AHALYSI FINITORUM 

*c 

DOCTRINA SER[ERUM 



jtuoT^ii ; 

LEONHARDO EULEHO 
AiiLk icaiifTr ir elis, eitt. j 



( 




a N r 1 V * j I 

ACA5EM|\e IMPEKMLfS SCfESTIARUM 

Pl:-flD>OL 3 TAKAE 
I r f f. 



Figure 6.1. Frontispieces of Introductio . . . and of Institutiones calculi ... by Leonhard 
Euler (1707-1783). 



log(l + x) = X 



-x" + -x^ + ■ 



from which 

log2 = l-i + l 
James Gregory (1638-1675) found 




x^ x^ 

arctan x = x — ;r + "7" 
3 5 



nowadays called a Gregory series^ the special case of which with x = 1 is 
a Leibniz series. 

In this chapter we illustrate methods for the study of the convergence of 
numerical series, while in the next we shall deal with basic facts concerning 
power series. 



6.1 Basic Facts 

Given a sequence {an}, n = 0, 1, 2, ... , of real or complex numbers, the 
recurrence 

{ So = ao, 

Sn+l = Sfi + Un+l, Vt2 ^ 0 



( 6 . 1 ) 
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DISQVISITIONES GENERALES 

CIRCA SEfttEM JNFINITAM 

1 -v.- 

T C T <■ B 1 

C&K<3LO FRIQ£AICO CAVS^L 



r A t S j- 

»OJTATI XEQAt £ct$)imiVlt TUSJTV UE, l|U 



ii^Txoin^crro. 



In tkir DtnRiMBti*Ti]4i«^|la‘utin f4:rdpisi3t.1ife. 
hDAio qUJIDM ^l]iBDH«tU« !>■ ?. >. V fp«flp<i 
IppH iUmtU T4tt|lA«, «id1iH fgg »l«iuni4n< pclarata •, 

C, CirtBai f t qpimiB j dl4t*£tircl*^. JU leiTgda etiDnrUin 



Tk *J = f {b* C > *>, 



T44tt)Ao ciMMTn i«, ^.7 viIwH AHvBrnphri . frtin n4ftri 
(b HuAiiMvin Tblcif *«ri|bi(v K ij^M Pii*ifrSn p»6 t«ml. 

Bflbi I «vl I -» ("* li'h'timfchB'i 6* - ritnu- 

ixni iKfjir be^uuMi M F»|iqBll ¥f» In iLpiaiftim m«- 



0elfrA|sr« £itr Tlieorl« 



twrfk ik Gun KhF BHW ^ 7 ^ «t 



darslellbareD Funetionen 



IffnMwija, 



GBttlBiieii, 

V«rli| 4ft |I'r»NFJ<k*ifh«i ii.rlkii4lii.d. 



Figure 6.2. The first page of the paper by Carl Friedrich Gauss (1777-1855) and the 
frontispiece of G. F. Bernhard Riemann’s (1826-1866) paper on hypergeometric series. 



defines by induction a unique sequence {^n}, 

n 

Sn := Uq a\ U 2 CLn — ^ ^ Cij 5 

j=0 

see Example 2.5. The sequence {sn} is called the sequence of partial sums 
of {an} or just the series of {an}- We use the symbol 

00 

j=o 

in order to indicate the sequence {sn}. The presumption let us consider 
the series ^3 ’ therefore a shorthand for let us consider the sequence 

of partial sums {Yl^=o^j} sequence {an}- 

а. Definitions and examples 

Let Cin G M, be a series of real numbers, and let {s^}, Sn '-= 

t>e the sequence of partial sums. 

б. 1 Definition. If {sn} converges to L, we say that the series converges 
and that L is the sum of the series. 

Actually, three alternative situations are possible: 

(i) limn -^00 exist; we then say that the series is inde- 

terminate. This is the case of the series 
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(ii) linin-.oo Y^^=o e M, the series converges to L and we write 

oo 

^ ^ On “ ^ 
n=0 

for L, n OO. 

(iii) limn-^oo X^j=o (resp. -oo). In this case we say that the 

series diverges to +oo (resp. — oo) and we write 

oo oo 

'^an = +00 (resp. ^ fln = -oo). 

n=0 n=0 



6.2 Show that 3 3^ diverge to H-oo. 

6.3 Example (Geometric series). We saw in Example 2.65 that 



GsH := ^9^ = 

3=0 

consequently the geometric series 



n + 1 




if g = 1, 
otherwise, 



^ ( converges to if \q\ < 1, 

\ diverges to + oo if g > 1, 

^ ^ ( is indeterminate if g < — 1. 



6.4 Example (Telescoping series). These are series for which the general term an 
can be expressed in the form 

0>n = bn ~ bn—1 

for a suitable sequence {bn}- In this case 

n 

aj = {bn - bn-l) + {bn -1 - bn-2) H 1- (63 - 62) + (62 - bi) = bn -bi. 

An example is given by Mengoli’s series J(j+T) » named after Pietro Mengoli 

(1626-1686). In fact 
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hence 



1 

j=i j{j+i) 



Vi > 1, 



i(i + 1) i i + 1 

= 1 — for all n > 1. We therefore get 



ou - 

V - = 1- 

jrli(i + i) ’ 

see an illustration in Figure 6.3. 

Actually, the telescoping trick allows us to regard every sequence as the partial sum 
of a series. In fact, if {an}n>i is given and we set ao = 0, we have 



n 

“■ Qij — 3^) difi do j VtI ^ 1, 

J=1 



consequently we see that the series 53^0 — ®i-i) converges, diverges or is indeter- 
minate according to whether {an} converges, diverges or has no limit. In the first two 
cases 

oo 



%-i) 

j=i 



= lim dn . 

n—*oo 



6.5 Example (Arithmetic-geometric series). There is a closed formula also for 

n 

S{n)~J2jq^, n>l, 9 eC, 9 # 1 . 

j = l 

In fact, multiplying S{n) by 1 — we get 
{l-q)^S{n) = 'Y^jq^ - 2^^ 



j=o 



j=0 



3=0 



= 9 + ((j - 2(j - 1) + (j - 2))g^) - 2 n 9 ™+i + (n - 1)?"+^ + 



nq 



,n-j-2 



i=2 



that yields 



j=l 



0^^ ' 



Since nq^ — » 0 as n oo if and only if |g| < 1, see Example 2.59, Yl%zojq^ converge 
if and only if |g| < 1 and in this case 

oo 

6.6 Infinite product. We can define the infinite product of a sequence 
of positive numbers {a^}, 



n Oi := lim TT Ui, 

n—*oo 

i=l iz=l 

when it exists. Trivially, HSi if and only if the series Yl^i 

converges and 



JJoi = exp^^logOi). 



i=l 



i=l 
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b. A necessary condition for convergence 

6.7 Proposition. converges^ then an 0. 

Proof. In fact 

n n— 1 

On — ^ h L Oj 

j =0 j =0 

if L is the sum of ^ 

However, the condition 0 is not sufficient to ensure convergence 

of instance, we shall see in Example 6.27 that the harmonic 

series n <^iverges. 

c. Series and improper integrals 

The concept of sum of a series can be seen as a particular case of an 
improper integral (see Section 4.5.2 of [GMl]), and this is quite a useful 
remark, see, for instance Example 6.25. To a sequence {un}, n > 0, of real 
numbers, we associate the piecewise constant function (f : [0, +oo[— > R 
defined by 

(Pa{x) = an if n < X < n + 1. 

Clearly (p is measurable and, for all n > 0 we have 

^ /»n+l 

y^aj = / Pa{x)dx. ( 6 . 2 ) 

Actually 

6.8 Proposition. The sequence of partial sums of {an} has a limit in R 
if and only if (pa(^) dx has a limit when x +oo. In this case 

OO px 

E aj = lim / paMdx. 

X— ►oo /n 
j =0 *^0 

In particular % converges if and only if <p has an improper integral 
at infinity 

6.9 f . Prove Proposition 6.8. [Hint: Show that linia:-^+oo Jq ^a{x) dx exists if and only 
if limn^oo Jq (fa{x) dx exists and 

px rn 

lim / (pa(x)dx= lim / (pa{x)dx. 
x-^-\-ooJq u-^ooJq 

For that, set <j){x) := Jq <f{x) dx and observe that for n := [x], 

{ (/>(n) < <j){x) < (j)(n + 1) if an > 0, 

(j>{n + 1) < (!>{x) < (j){n) if an < 0.] 



( 6 . 3 ) 
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Figure 6.4. Frontispieces of two books respectively by Emile Borel (1871-1956) and 
G. H. Hardy (1877-1947) on divergent series. 



d. Decimals 

Every real number has a decimal representation x = go? ^i? ^2? ^ 3 ? • • • > which 
is defined iteratively by 



Xo := X, qo := [aro], 



Xj+x ■- Xj - ^ 
Qj+i •= [10^‘^^^j+i] 



where [a] is the largest integer not greater than a. Inductively we also see 
that Qj 6 {0, ... ,9} for j > 0 and that 0 < In particular 

Xj 0 as j > 00 . Moreover 



N N 

m ^ ^0 + XI = qo+Xi- Xjv+i = x- xn+i 

j=l j=l 

hence, when N oo 



IL - n -U ^ -4- 

/ V in'! ^0 1 n 



Q2 



j=0 



10^ 



+ 



Q3 



10 100 1000 



+ ’ 



or, as we commonly write, x = go? QuQ 2 j ^ 3 ? • • • • 

The algorithm giving the decimal alignment may stop after N steps, 
i.e., Xj = Qj = 0 Vj > N. This clearly happens if and only if x = 
k G {0, ...,10^^ — 1}. However, even for rational numbers the decimal 
alignment may be infinite as for 1/3 = 0, 333333 .... 
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Conversely, every series ’ ^ {0? 1? 2, . . . , 9}, i.e., every 

decimal alignment go, 92 , • • • , is (converges to) a real number. In fact, 
the series converges because the sequence of partial sums is increasing and 



E 



j=i 



Qi_ 

10^' 




9 1 

10 1 - 1/10 



= 1 



with equality if and only if = 9 Vj > 1. The same reasoning actually 
yields that, for any > 0, 



CX ) 

= 10“^ if and only if = 9 Vj > AT + 1. (6.4) 

j=iV+l 

We conclude proving 



6.10 Proposition. Two decimal alignments go, gi, 92 , • • • and ho, /^ 2 , • • • 
have different sums (represent different numbers) except when there exists 
an integer N such that 



either 



9iv = 1 + hiv, 

qj = 0, hj = 9 > W, 



or 



hAT = gN + 1, 

Qj = 9, hj = 0 Vj > N, 



Proof. Suppose that qo + ^ i-®-, 



qo — ho -\- 



OO I 

E gj ~ ^ Q 

IQj 



j=l 



If qj = hj does not hold Vj, denote by iV the first index for which q^ h^- Then 



QN - hN 
10^ 



= - E 



~ ^ j 



j=N+l 



103 



: —Rn 



and |gjv - h/v| = 1 since \Rn\ <10 If for instance hiv = 14- then 



Rn= 



^3 hj _ 
103 



j=N+l 

(6.4) then yields qj — hj = 9, that is, qj — 9 and hj = 0 for all j > 1. 



6.11 Example. 0.234765799999 . . . and 0.2347658 are the same rational number. 




6.2 Taylor Series, e and tt 
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Figure 6.5. Two popular books on e and tt. 



6.2 Taylor Series, e and tt 

A simple and natural way of generating convergent power series is by 
starting with a function / :] — a, a[^ M of class and considering its 
Taylor’s series 



E 

j=0 



DV( 0 ) , 
J' 



X e 



which has as partial sums Taylor’s polynomials 



Pn{x) := ^ 



D^fiO) 



j=0 



Obviously 



2 ^ — = /(^) 



j=0 



if and only if the remainder Rn{x) := f{x) — Pn{x) tends to zero as n ^ cx). 
This happens to be true for quite a number of elementary functions, as we 
shall see in the following examples. However, in general 



m # f: vx ^ 0 , 



n=0 



as shown for instance by the following 
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Figure 6.6. The functions 1/(1 + and log(l + x) and their respective Taylor polyno- 
mials of order n. 



6.12 Example. It is easily seen by induction that the function 

I exp (— 1/x^) if X ^ 0, 



/W 



{ ex 
0 



if X = 0 



has derivatives of any order and D^f{0) = 0 Vj. Its Taylor series sums to zero while 
/(x) 7 ^ 0 for all x / 0. 



Some of the following examples have already been discussed, see e.g, 
Section 5.1 of [GMl], but, for the reader’s convenience, we repeat them 
here. 



6.13 Example (Logarithm). Replacing x by — x in the well-known identity 

j.n+1 

1 ■ 



1 , x’^+i 

— = + 

— X ++ 1 — X 



k=0 



we get 



1 -t\k k (~x)’^~*'^ . 

=> (-l)^X^ + -^^ , 



and integrating between 0 and x, 



log(l + x)= r :^dt= + 

•/o 1 + * Jo ^0 



x) 



_fc+l 






fe=0 



where 






^ (_t)^+l 



+ t 



dt. 



The remainder can be easily estimated for x > —1 by 



(6.5) 
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Figure 6.7. The functions sinx and cosx and their respective Taylor polynomials of 
order n. 



l-RnWI < max (l, 7 ^) , (6.6) 

\ 1 X/ n + 1 

hence it converges to zero if — 1 < x < 1. We then conclude that the Taylor series of 
log(l + x) converges if — 1 < x < 1 and 

oo k-\-l 

log(H-x) = 5^(-l)"f— , -Kx<l. (6.7) 

fc =0 ^ 



6.14 Example (The arc tangent function). Starting from (6.5), replacing x by 
— and integrating we get 



arctanx = 



/■ 



1 

1 + 



dt = 




dt + Rn(x) 



n 2fc+l 



where 



Since 



we infer that Rn{x) 



Rn{x) := £ ^ 



(_(2)n+l 



ij?n(x)i< rt^^+^dt 
Jo 



■ dt. 

|a.|2n+3 

2n -|- 3 



> 0 as n — ^ oo if |x| < 1, concluding that 



^ a;2n+l 

arctanx = ^(—1)’^;: — 1^1 ^ 1- 

n=0 



2n H- 1 



( 6 . 8 ) 



(6.9) 



6.15 Example (Taylor series for e^, sinx, cosx). We know that Taylor’s polyno- 
mial of degree n of centered at 0 is 



Pn(x) 



fc =0 



k\ 
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Figure 6.8. The functions arctanx and log ^ind their respective Taylor polyno- 

mials of order n. 



and Taylor’s formula with Lagrange remainder, see e.g., 5.5 of [GMl], yields 

J2_ 






fe =0 



k\ (n + 1)! 



,n+l 



for a suitable in the interval ]0, x[ (or ]cc, 0[ is a: < 0). Therefore, for any a: € M, 

,|n+l 






A;! 



< max(e^, 1) 



(n + 1)! 



► 0 as n — oo. 



thus concluding that the series ^ converges and 

e" = E^ V:n€R. 

n 



Similarly, see Figure 6.7, one proves that 



E(-i)^ 



y.2n-|-l 



(2n + l)! 



y,2n 



= sini, 1)"-; — —=cosx, Vx 6 R. 

(2n ! 

n=0 ^ ' 



( 6 . 10 ) 



( 6 . 11 ) 



( 6 . 12 ) 



6.16 %. Prove (6.12). 



a. The number tt 

Approximated values of tt have been known since ancient times. The first 
twenty digits are 



TT = 3.14159265358979323846 .... 

The first anal}dic representation of tt was probably found by Frangois ViMe 
(1540-1603) in 1579 as the infinite product 
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Book of kings 

Arithmetic book by Ahmes (1900 
a.C.) 

Salbasutras (500 a. C.) 

Plato (V sec. a. C.) 

Archimedes (III sec. a. C.) provides es- 
timates from above and from below by 
means of inscribed and circumscribed 
polygons of 96 sides 
Zhang Heng (I sec. . C.) 

Ptolemy’s ('^ 150 d.C.) takes in the 
Archimedes approximation 

Wang Fang (II sec. d. C.) 

Liu Hui 263 d.C.) estimates with 
polygons of 192 sides 
Liu Hui ('^^ 263 d. C.) estimates with 
polygons of 3072 sides 

Zhu Chong- Zhi (430-501) finds tt up 
to six digits with a convergent in the 
continuous fraction development 

Araybhata (498 d. C.) and al-Hwarizml 
(IX sec. d.C.) 

Leonardo Pisano (1170-1250), called 
Fibonacci estimates with polygons of 
96 sides 

Albrecht Diirer (1471-1528) 

Ludolph Van Ceulen (1540-1610) finds 
7T up to 35 digits 



7T 3 

7T ^ (16/9)^ 3.16 



TT - (26/15)2 

TT y/2 + VS 

o 10 

3+— <7t<3H- 
71 7 



3.0044 

3.14626 
1 



7T y/TO ^ 3.162 

7t^3 + — -3.14166 
120 

142 

7T — — 3.155 

45 

64 

314+ < TT < 3.14 -f 

62500 



169 

62500 



7T - 3.14159 



7T 



355 

113 



- 3.1415929 



7T — 



7T — 



62832 

20000 

864 

' rsj 

275 



= 3.1416 
3.141818 



TT — 3 H — 
8 

3.14159... 



Figure 6.9. The values of tt computed or estimated before the infinitesimal calculus. 



2 

TT 




see (6.26); and John V^allis (1616-1703) in 1655 found 

7r 2-24-46-6 2n-2n 

2 TTs 3^ ‘ (2n-l)(2n + l) 



see (6.29) below. In 1671 James Gregory (1638-1675) found the represen- 
tation 



independently found also by Gottfried von Leibniz (1646-1716) in 1674. 
Computing the Taylor series of arcsinx, 
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n 


Sn 


Rn 


n 


5n 


Rn 


1 


2.666666666666667 


5£;-oi 


1 


3.079201435678004 


QE-02 


3 


2.895238095238096 


2£;-oi 


2 


3.156181471569954 


-IE -02 


10 


3.232315809405594 


1 

1 

o 

to 


3 


3.137852891595680 


1 

o 

CO 


30 


3.173842337190750 


-3£' - 02 


5 


3.141308785462883 


o 

1 

CO 


100 


3.151493401070991 


-IE-Q2 


7 


3.141568715941784 


2J5-05 


300 


3.144914903558853 


CO 

o 

1 

CO 

1 


9 


3.141590510938080 


2£;-06 


1000 


3.142591654339544 


1 

1 

o 

w 


11 


3.141592454287646 


2E-07 


3000 


3.141925875839790 


1 

CO 

1 

o 


13 


3.141592634547314 


2E-0^ 


10000 


3.141692643590535 


o 

1 

1 


15 


3.141592651733998 


2E-00 


7T 


3.141592653589793 




7T 


3.141592653589793 





Figure 6.10. The partial sums Sn '= E7=o error Rn := tt — 5n: (a) on the 

left, for aj :=4{-iy/{2j + 1); (b) on the right for aj := 



arcsin x = 



E 



n=0 



1 • 3 • • ■ (2n - 1) 
2-4--2n 



^2n+l 

2n+l’ 



kl < 1, 



for X = 1/2. In 1665 Sir Isaac Newton (1643-1727) found 

l-l ill lllJi 13511 

6”2“^2 3 S'^2 4 5 32 “^2 4 6 7128 “^ ** 

The number tt is the area of the unit circle that is, according to calculus, 



7T 

2 




— dx 



(see [GMl]), or the length of the halfcircle, that, as one can prove, is given 
by ^ ^ 



6.17 A few series that sum to tt. From (6.9) and (6.8) we find the 
Leibniz-Gregory result 



oo 

^ = arctanl = y^(— 1)^ 

n=0 



1 

2ti -|- 1 



with the estimate for the rate of decay of the remainder 



(6.13) 



E (-1)'' 



fc=n+l 



2fc + l 



= \Rn{l)\< 



1 



2ti “h 3 



However, the estimated rate of convergence is not fast, in accordance with 
the computed values in Figure 6.10 (a). 
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n 


Sn 


Rn 


1 


3.140597029326061 


1^-03 


2 


3.141621029325035 


-^E - 05 


3 


3.141591772182178 


9£’-07 


5 


3.141592652615309 


lE-m 


7 


3.141592653588603 


IE -12 


9 


3.141592653589793 


4E- 16 


TT 


3.141592653589793 





Figure 6.11. The partial sums Sn '= error Rn := tv — Sn for aj = 

^ — 2j + l (sj + l “ (239)J + 1 ) ■ 



A better approximation of tt than (6.13) can be obtained in several 
ways. For instance, observing that | = arctan we get again from (6.9) 

oo 

I = 



Vf"*'{2n+1)' 

^ 1 

3^{2n + l)' 



n=0 



i.e., 



TT 



= E(-') 



n=0 



(6.14) 



If \/3 is known, then (6.14) is a far better representation than (6.13), since 
(6.8) yields an exponential decay for the error. 



E (-')* 



k=n-\-l 









< 



VsJl ~ \/3(2n + 3)3^’ 



see Figure 6.10 (b). 

An even better approximation can be obtained using a simple trick. 

Recall that ^ 

tan a + tan p 

ot + (3) = 

1 + tan a tan j3 

Starting with a := arctan(l/5), we then get 
tan 2a 



5 1 

tan 4 q; = 1 + 



12 ’ — 
if we take into account that 4a > 7t/4. Hence 



tan(4a - tt/4) = — , 



1 



TT 1 

— = 4a — (4a — tt/4) = 4 arctan - — arctan ■ 

4 5 239 



and conclude by (6.9) and (6.8) 



7T 

4 



(-1)" / 4 I n 

A:;2n + lV5«+i (239)"+!/ 



(239)"+! 

with an exponential decay estimate for the remainder, see Figure 6.11. 



(6.15) 
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n 


5n 


Rn 


n 


Sn 


Rn 


1 


1.000000000000000 


-3E* - 01 


1 


0.691358024691358 


2E-03 


3 


0.833333333333333 


-if; -01 


2 


0.693004115226337 


IE -04 


10 


0.645634920634921 


5F;-02 


3 


0.693134757332288 


IE -05 


30 


0.676758137691398 


o 

1 


5 


0.693147073759785 


IE -07 


100 


0.688172179310195 


5E-0S 


7 


0.693147179548241 


IE -09 


300 


0.691483291655625 


to 

1 

o 

00 


9 


0.693147180549812 


IE -11 


1000 


0.692647430559822 


5F;-04 


11 


0.693147180559840 


IE -13 


3000 


0.692980541671060 


2F;-04 


13 


0.693147180559944 


IE -15 


10000 


0.693097183059958 


5F;-05 


15 


0.693147180559945 


2E-16 


log 2 


0.693147180559945 




log 2 


0.693147180559945 





Figure 6.12. The partial sums Sn := errors Rn := log2 — Sn for, on 

the left aj := (— + 1), and on the right aj := 2/(3^-^“^^(2j + 1)). 



6.18 Approximations of log 2. Similarly, one can find a series which converges to 
log 2. In fact, the Taylor series for log(l + x), (6.7) and (6.6), yield in particular 






1 1 1 



(6.16) 



and the estimate of the rate of decay for the remainder 

Uog2-J2^—^\ = \Rn{l)\< 



k=l 



n + 1 



This suggests that i?n(l) decays slowly to zero, as in fax:t is the case, see Figure 6.12 (a). 
A better approximation of log 2 is obtained by observing that, for 0 < x < 1, 

' 1 + X 



log ( -) = log(l + x) - log(l - x) 

M - X/ 



^n+l 



= Y ((-!)'' + 1) =2V , 

^2p+l’ 

n=0 p=0 ^ 



hence 



log 2 = log 



^ + 1/3 = 2 ^ 

1-1/3 ^o(2n + l)32"+i’ 



in this case the error decays exponentially, see Figure 6.12 (b), as 

" 1 1 

2l. Qfc “ 



E 



< 



^ {2k + l)32fc+i - 3(2n + 1) ^ 9^ 8(2n + 1) 9^" ’ 



b. More on the number e 

Prom (6.11) and (6.10) we infer 
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n 


5n 


Rn 


1 


1.000000000000000 


2E + 00 


3 


2.500000000000000 


2E-01 


5 


2.708333333333333 


IE -02 


7 


2.718055555555555 


2E-04 


9 


2.718278769841270 


3E-06 


11 


2.718281801146385 


3E-08 


13 


2.718281828286169 


2E-10 


15 


2.718281828458230 


8E-13 


17 


2.718281828459043 


2E-13 


e 


2.718281828459045 





Figure 6.13. The partial sums Sn ’= error e — Sn- 




(6.17) 



with 



oo 

i: h 



fc=n+l 



(n + 1)! 



< 



4 

(n + 1)!’ 



(6.18) 



since 2 < e < 4. For instance, for n = 6, j\ “ 2.718055 . . . approx- 
imates e from below with an error not higher than 4/7! = 1/1260, which 
yields 

2.718055 . . . < e < 2.718849 .... 



The estimate (6.18) also implies the following. 



6.19 Theorem, e is irrational 



Proof. In fact, suppose on the contrary e rational, e = p/q, p, g G Z, g ^ 0. Prom (6.18) 




4 

(n + 1)!’ 



Multiplying by n! we then get 



P 

q 



;! - n! 53 1 /j! < 

j=0 



4 

n-h 1 



that is, n! !/.? • being integers for n > g. 



1 

- = — = 0 Vn > max(3, g), 

^ j=o 



a contradiction. 



□ 
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6.20 1[. One can estimate the error better. Show that 

e-E- < — 

nn\ 

which yields, for n = 6, 2.718055 . . . < e < 2.718287 — [Hint: Write 



e 




oo 

= E 

j=n-\-l 






oo - 

E 

fy) (n+l+j)! 



and observe 



(n + 1 + j)! = (n + l)!(n + 2)(n + 3) • • • (n + j + 1) > (n -|- l)!(n + 2)-^ .] 



6.3 Series of Nonnegative Terms 

The problem of studying the convergence of a series (of real terms) simpli- 
fies a great deal if we restrict our attention to series of nonnegative terms. 
In fact, in this case the sequence of partial sums is increasing^ therefore it 
has a limit that can be finite or infinite. In particular we can pass to the 
limit in equalities and inequalities involving the partial sums and we can 
estimate the sum of the series. 

For example, if aj< bj for all j > p, then 

n n 

h ^ p; (6-19) 

3=P j=P 

however we cannot take the limit as n oo until we know the existence of 

the limits ^ series of positive terms (or definitively^ 

nonnegative, or even definitively of constant sign) the respective sequences 
of partial sums are (definitively) monotone, hence the existence of the sum 
is granted. We therefore can state 

6.21 Proposition (Comparison test). 

two series of positive terms. Suppose that aj < bj for all j > p. Then 

(i) if ^3 converges^ then YlJLo converges, 

(ii) if ^3 diverges, then diverges. 

In both cases 

oo oo 

^ 3 ' ( 6 . 20 ) 

j=p j=p 



^ We shall say that a predicate p{n) holds definitively if there exists n such that p{n) 
holds true for all n>n. Notice that “definitively” is much more than “for infinitely 
many indices.” 
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Figure 6.14. The dashed area Hn and the graph of 1/x, x > 0. 



6.22 Example. Since 



we infer 



hence 






j{j + 1 ) 



, Vi>l, 



n - n - 



j = l ^ 



OO - OO - 

Actually, compare 7.79, we have 1/j^ = tt^/6. 

6.23 Example. Let us show that the series logcos(l/n) is convergent. 

Observe that all terms of the series are negative, and that cos 1 < cos(l/n) < 
The estimates (see, e.g.. Section 5.4 of [GMl]) 



logx > K{x — 1), cos 1 < aJ < 1, K := — 



log(cos 1) 
cos 1 ’ 



cos x>l xGM 



yield then 



hence 



-log cos - < — ~ 2 , 
n 2 



1 a: ^ 1 

-Z^logcos- < <+~ 

n=l n=l 

by the comparison test and Example 6.22. 

A variant of Proposition 6.21 is 



6.24 Proposition (Asymptotic comparison test). Let 

YlJLo ^3 series of positive terms. Suppose that 

^ ^ L € R. 

bn 

(i) If “i diverges, then diverges. 



1 Vn. 



i and 
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(ii) If ^3 converges, then Yl%zo converges. 



Proof. In fact the sequence On/bn is bounded, i.e., 3M > 0 such that 
dn M bn for all n. The comparison test in Proposition 6.21 then yields 

□ 



ET=o<^j<MET=ob. 



Uj. 



а. Series of positive decreasing terms 

б. 25 Example (Harmonic series, I). An especially relevant series is 




called the harmonic series, since the numbers 1, 1/2, 1/3,. , 1/n represent the ratio 
of the lengths of “harmonic” vibrating strings. 

There is no closed formula for the partial sums Hn '= Yfj=i J- However, it is easily 
seen that the harmonic series diverges, Hn —*■ oo, moreover its partial sums Hn can be 
estimated by considering the improper integral associated to it. 

Let : [0, +oo[— > M be the piecewise constant function defined by 




if j - I < X < j. 



As we have seen, and it is evident (see Figure 6.14), 




(p{x) dx. 



On the other hand. 



1 

a; + 1 



< ip{x) < — 
X 



'ix > 0 , 



hence 



nn 

Jo 



1 X 



■ dx < Hn 




dx , 

X 



log(l + n) < ifn < 1 -f- logn. (6.21) 

Hn is therefore asymptotic to logn, Hn/iogn — > 1. In particular Hn tends to 
infinity quite slowly as n — > oo, see Figure 6.15. 



6.26 Example (Euler— Mascheroni constant). Let us now consider the difference 
7n := Hn — logn. From (6.21) we see that 0 < 7n < 1. Since 



7n = 1 + / (¥>(l) - 1) 



dx and 



(p(x) < Ijx Vx, 



the sequence {7n} is decreasing and has limit 7. Since 1/x is convex, we also deduce 
for X e\j - l,j] 





+ 



1 

T ? 

3 



hence 



1 

X 




and, integrating on [— l,n]. 
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n 


Hn 


Hn/logn 


- 1 


Hn - log n 


10 


2.928968253968254 


3E- 


01 


0.626383160974208 


30 


3.994987130920391 


2E- 


01 


0.593789749258235 


100 


5.187377517639621 


IE- 


01 


0.582207331651529 


300 


6.282663880299502 


IE- 


01 


0.578881405643301 


1000 


7.485470860550343 


8E- 


02 


0.577715581568206 


3000 


8.583749889959170 


7E- 


02 


0.577382322308925 


10000 


9.787606036044345 


6E- 


02 


0.577265664068161 


30000 


10.886184992119919 


6E- 


02 


0.577232331475626 


100000 


12.090146129863282 


5E- 


02 


0.577220664893053 



Figure 6.15. Hn = Hn/logn - 1 and Hn - logn. 



7n = l + ^ (<^(x)-i)dx>l-ix:(-^ 

^ j=2 '> 




n) 2 2 n 



In conclusion we can state 



Proposition. The partial sums Hn of the harmonic series are asymptotic to logn. 
Moreover {Hn — logn} is a decreasing sequence with limit 7 g]1/2, 1[. 

The previous constant 7 is called the Euler-Mascheroni constant. It has an approximate 
value of 0.57721566 . . . , but it is not known if it is irrational or rational. 

Prom 

Hn = logn + 7 + 0 ( 1 ) 

we see 




This explains the slowness of the convergence Hn/ log n — > 1 that one sees in Figure 6.15. 



6.27 Example (The harmonic series, II). One can prove that the harmonic series 
diverges also as follows. Observe that we have 



1 = 1 < 1 , 



1 

2 

1 

2 

1 

2 



2 

4 

4 

8 

16 



< 

< 

< 



1 1 
2 3’ 

1111 



1 1 

— “f — + • 
8 9 



• + + 



14 




1 

2 



thus 



in particular H 2 ^-i 
ing, we conclude Hn 



2 n 

2n+l 



< 



ST ' 2'^-1 1 

^k=2^-^ /j.’ 



^ < H 2 ^-i for all n, 

00 . Since {H 2 n~i} is a subsequence of {Hn} and Hn is increas- 
• 00 . 
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6.28 Example (The generalized harmonic series, I). Consider the series 
a ^ 1, and the piecewise constant function <p : [0, +oo[— ^ M defined by 



<p(x) = — if n — 1 < X < n. 

For n > fc > 1 we have 

y2r-= <p{x) dx. 

Since l/(x + 1)*^ < <^(x) < l/x“ for all x > 0, and therefore 



rn 1 ] 

/ 7 ^ dx<^ — <1+ — dx, 

7o {x + 1)^ “ Ji x°^ 



we conclude 

(n + l)-"+l-l 1 n-°+l-l 

—a + 1 “ ~ —a + 1 

Therefore we can state 



Proposition. The generalized harmonic series, converges if and only if 

a > 1 and 



1 



a — 1 



OO ^ - 

<E — <i + — • 

a — 1 



The reasoning in Example 6.27 extends to obtain 



( 6 . 22 ) 



6.29 Theorem (Cauchy condensation test). Let ^ series 

of nonnegative and decreasing terms. Then converges if and only 

ifYl^o converges. Moreover, the following estimates hold: 

^ OO OO OO 

2 ^ E • (6.23) 

j=l j=l j=0 

Proof. By the assumptions made we have 



\2a2 


< 


ai 


< 


CLi , 


|4a4 


< 


d2 + 


< 


2 a 2 , 


5803 


< 


6^4 + 6^5 + fle H“ ^7 


< 


4fl4, 


^16016 


< ( 


ag + ag + • • • + ai 5 


< 


Sag, 


i 2 "+ia 2 n+i 


< 


2-jj=2^ ^3 


< 


2 ^a 2 « 


Summing, we infer 

-j n+l 


2 "^+^-! n 

< E «i^E 


Va23 




J =1 




j=l j =0 







(6.24) 
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for all n > 0. We then conclude 

n+l 



n oo 



^ 2^ 02i ^ ^ . XI ^ X) “j- 

j=o j=0 j=l :7=l j=l j=l 

On the other hand Xlj=i a subsequence of Uj, hence 

+ C50 

j=i j=i 

Passing to the limit in (6.24) we get (6.23), hence the result. 



□ 



6.30 Example (The generalized harmonic series, II). The Cauchy condensation 
test yields 



Proposition. The generalized harmonic series converges if and only if a > 1. 

Proof. In fact the assumptions of the Cauchy condensation test theorem are satisfied, 
therefore ^ converges if and only if the geometric series 

oo - oo - 

E 2-? — = V (-—y 

Ootj 'Oa — 1 * 

j=0 ^ j=0 

converges. The last converges if and only if 1/2““^ < 1, that is, a > 1. □ 



b. The root and ratio tests 

Some comparisons are more frequent than others. They lead to rules, called 
convergence tests. Here we present two of them: Cauchy root test and 
d’Alembert’s ratio test 

6.31 Theorem (Root test). Let be a series of nonnegative 

terms. Suppose there is a positive constant K < 1 and a natural p e N 
such that for all n>p, 

^ <K <1. 



Then an converges and 



< 

n=p 



KP 

1-K' 



If there is a positive constant K > 1 and a natural p G N such that for all 
n>P > K > 1, then a„ diverges. 
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Proof. In fact, for j >p we have 0 < a j hence 



n 

3=P 



n n—p 

j=p j=0 



< 



KP 

1-K' 



Passing to the limit as n ^ oo, the claim follows. 
Similarly one proves divergence if > 1. 



□ 



6.32 Proposition (Ratio test). LetY!,^=i(^n be a series of nonnegative 
terms. Suppose there is a positive constant K < 1 and a natural p G N 
such that for all n > 

Un 

Then converges and 






Op 

1-K' 



Suppose there is a positive constant K > 1 and a natural p eN such that 
< K < 1 for alln> p. Then diverges. 

Proof. Inductively we find 



(Zp_}_2 ^ KcLp^i ^ K dp, 



Q/p^j ^ Op-^j — \ ^ Op^j — 2 ^ ^ Op , 



hence, summing from p to n, 

n n-p ^ 

dj < dp ^ < Op ^ 

j=p j=0 

When n oc, we get the result. 

Similarly one proves divergence if an-\-\jcLn > > 1. □ 

6.33 Remark. Notice that root and ratio tests are inconclusive if ^j/d^ 

1 or dn-fi/dn 1, as is shown by the generalized harmonic series. Also, 
because of Example 2.57, whenever the ratio test yields convergence, the 
root test does, too; whenever the root test is inconclusive, the ratio test is 
inconclusive, too. 
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Figure 6.16. Prangois Viete (1540-1603) 
and the frontispiece of his Opera Math- 
ematica. 



FUANCISCL Vl£TA 

OPERA 

MATHEMATIC A, 

In unum Vofum^in 

iKivoogniEaH 

Untybiiin6a^ 




LTCpm U*IJTOtTl^ 
£i diidmSwvniBn k AbtilWMi 

TITuTTiTh 



c. Viete’s formula for tt 



From sin a; = 2sin(x/2) cos(x/2), we infer by induction 

sinx = 2”sin(J) H cos (^) 
k = l 

and, since, 2^^ sin(cc/2’^) — > x, we find 



sin X = X 




(6.25) 



On the other hand cos^(x/2) = (1 + cosx)/2, thus cos(x/2) = y f ^ cosx for x G 
[0, 7t/2], hence 




Therefore (6.25) with x = tt/ 2, yields the Vi^e formula 




Notice that the Viete sequence {xn}, 



Xn := n cos (^) = 



fc=l 



2'^ sin 






converges exponentially fast to 2/7 t; in fact we have 



2 ^ 1 

0 < x„ - - < C — . 

5T 4" 



(6.26) 



(6.27) 
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6.34 f . Show that fl^i cos(x/2^) converges. 

6.35 Prove (6.27). [Hint: Use that sinx > x — x^/S\ holds for x > 0.] 



d. Euler and Wallis formulas 

Prom de Moivre’s formula 

(cos t -\-i sin t)^ = cos sin fct, t eR, k E Z, 

we see that ^ 

sin kt = sint^fccos^“^ t — cos^~^ tsin^ t H ^ 

By observing that cos^’^ t = (1 — sin^ we readily conclude that sin fct, for A; = 2n + 1 
odd, is a polynomial in sint of degree k. Since sin((2n + l)t) has 2n + 1 distinct zeros 

^ ~ • • • 5 ? ^5 

n 

sin(2n + l)t = C (sin t — sin t j ) = C sin t (sin t — sin t j ) . 

j= — n 

Dividing by C and passing to the limit for t — > 0, 

C • sin tj = 2n + 1 



j = — n,...,n 

j¥=o 



j = — n,...,n 
3^0 



and we get 



sin(2n + l)t = (2n + 1) sint TT (^1 — = (2n + 1) sint V 

' sin to ' . - ' sin tn ^ 

j = — n,...,n J J = 1 ^ 

Finally, replacing ( 2 n + l)t by a: we deduce 

/o ^ ^ TT /^i sin2(x/(2n + 1)) \ 

\x = ( 2 n + 1) sm ( ) 1 I (1 5- ^ ) . 

V2n + l/“\ sm^(^7r/(2n + 1)) / 



sma 



When n ^ 00 , a “naive” passage to the limit yields 

^2 



sinx = cc J~J ^ ^ ^ ^ ^ 



j=i 



that, for X = tt/ 2, yields, in turn, Wallises formula for tt (see Example 2.66), 
2 2n — 1 2n + 1 tt 2n 2n 

n=l 



00 pj 

7 T 1 — r Zn 

or — = M 

2 2n - 1 



7T n 2n 2n ' 2 2n - 1 2n + 1 



(6.28) 



(6.29) 



n=l 



Actually, Euler’s formula for sin is equivalent, for |x| < tt, to 






J = 1 

Since differentiation term by term is allowed in the series on the right, it turns out to 
be equivalent also to 



1 o ^ 1 

COtX = 2x > r 

'T * 



JirJ J 



(6.30) 
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known as Euler’s formula for cotangent. 

The natural context of Euler’s formulas for sin and cot is the theory of complex 
functions. There they arise in a transparent and simple way. As Jacques Hadamard 
(1865—1963) put it: Le plus court chemin entre deux enonces reels passe par le com- 
plexes. Here we prove Euler’s formula for |x| < 1. 

First we state the following theorem that is interesting by itself. 



6.36 Theorem (of dominated convergence). Suppose that the double sequence of 
numbers aj^n Is such that 

(i) aj^n as n — )• oo for all j, 

(ii) for all n we have < cj with \cj \ < oo, 

Then o,j converges and 

oo oo 

aj^n o,j when n oo. 

j=l 3=1 

Proof. First observe that, since aj^n and \anj \ < cj 'in^j^ we also have \aj \ < cj 

Vj, hence converges absolutely. 

Fix e > 0 and choose p = p{e) such that 2 Cj < ^- Then 

oo oo oo p oo 

I ^ ^j,n ~ ^ ^ 0 ,j I < y ^ \0'j,n ~ y ^ Wj,n ~ I + ^ ^ \^j,n ~ C,j | 

j=0 j=0 j=0 j=0 j=p-\-l 

p oo p 

— ^ V \^j,n ~ I “I" 2 ^ ^ Cj < y ] |aj,n ~ I + C, 
j =0 j=p+l j=o 



hence 



limsup aj^n — o,j < e, 



3=0 



3=0 



and finally the claim, e being arbitrary. 

Proof of (6.28). Set for \x\ < 2, 

^3,n == S ^ 

[o ifj>n, 

and 

a,:=:=log(l-^). 

Clearly aj^n o,j. As sint > Vt G [0, tt/ 2], we have 






2n+l 



Vj < n; 



consequently 



2 

K,,|<-iog(i- J2) - 



and 



Cj < 00 . 

3 = t 



Applying the theorem of dominated convergence, we conclude 



the shortest path between two real statements is via complex 
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n 


1 


2 


3 


4 


5 


6 


7 


8 


an = (-l)’^/n 


-1 


1/2 


-1/3 


1/4 


-1/5 


1/6 


-1/7 


1/8 


at 


0 


1/2 


0 


1/4 


0 


1/6 


0 


1/8 


an 


1 


0 


1/3 


0 


1/5 


0 


1/7 


0 


\an 1 


1 


1/2 


1/3 


1/4 


1/5 


1/6 


1/7 


1/8 



Figure 6.17. an, at, an e \an\ per an = 



i.e., 



j = l 2n+l j=l 

n / oin2 X \ n o 

\ sm 2n+l / J = 1 J 



hence Euler’s formula for \x\ < 2. 



□ 



6.4 Series of Terms of Arbitrary Sign 

In the case where the terms of the series arbitrary sign, it 

is convenient to set 

at •= / ^ a~ •= I 

^ [o otherwise, ^ [O otherwise. 

Trivially = at - aj for all j > 0, hence YJj=Q = Zj=o ~ ^l=o • 

6.37 Proposition. We have 

^ converges if both converge, 

o ^3 diverges to +oo if SJIq diverges to +oo and J2^o 

verges, 

o % diverges to —oo ifYl^o converges and diverges to 

+ 00 . 

This way the study of the convergence of a generic series of real terms 
is subsumed to that of a series with nonnegative terms, except in the case 
where both divergent. 

a. Absolute convergence 

6.38 Definition. We say that YlJLo converges absolutely if the series 
l% l converges. 
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6.39 Proposition. converges absolutely^ then it converges and 

oo oo 

I I — I’ (6.31) 

^■=0 j=0 

Proof. We prove that \^n\ converges if and only if both 

and X]^o ^7 converge. In fact, for j > 0, at and aj are nonnegative and 
dj a~ = |aj|, hence at, a“ < \uj\. The comparison test yields that the 
series S^o^7 converge, consequently converges. 

By the triangle inequality, we get 

n n oo 

|X^aj| < ^ Vn>0, 

jz=0 j=z0 jz=0 

therefore we deduce (6.31) passing to the limit as n oo. □ 

b. Series of complex terms 

The notion of sum of a series easily extends to series of complex terms. 
We say that XZ^o converges (respectively diverges) if the partial sums 
have finite (respectively infinite) limit. The sum is then defined by 

oo n 

E Zn ■■= lim y^Zj. 

n-*oo 

n=0 j=0 

6.40 Example (Geometric series). For z G C, 2 : ^ 1, we still have 

j=0 

therefore converges for \z\ < 1 and 

00 

^ 2 ” = for \z\ < 1 . 

— 1 1 — z 

n=0 

If |z| > 1, since 

l^n+i _ i| > — 1 — > +00 as n — ^ OO, 

Clearly 6 oes not converge if z = 1 , since J2^=o = n + 1 . Finally, it can be 

proved, see Theorem 8.61, that does not converge for any z in the unitary 

circle { 2 :||z| = 1} C C, thus concluding that converges if and only if |z| < 1 

6.41 f. Show that 6 oes not converge if |z| = 1 . [Hint Assuming z 7 ^ 1 , let 

z := 0 7 ^ 2 /c 7 r, k G Z. Show one of the following claims: 

o does not have limit as n — > 00 , see Exercise 2.97. 

o If 6I{27t) = p,q coprime integers, then has q values. 
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If t/{27r) is irrational, then is dense on the unit circle (see Theorem 8.61). 

Thus has a limit iff 9/ (27t) is integer. 

Similarly, we have, see Example 6.5, that converges if and only if \z\ < 1 

with sum = (TTip-^ 1^1 < 1* 

Again, we trivially have, 

6.42 Proposition. IfYl^=o^n converges, then \zn\ 0. 

Cauchy convergence criterion. Theorem 4.23, yields 

6.43 Proposition. The series ^ny ^ C, converges if and only if 
the sequence of its partial sums is a Cauchy sequence, Le., iff Ve > 0 there 
exists n such that Ylj=p Zj < e for all p,q>n. 

6.44 Definition. We say that € C, converges absolutely if 

the series of nonnegative terms, J2n=o \^n\j converges, 

6.45 Proposition. If the series converges absolutely, then it 

converges; moreover Yln=p kn| for all p. 

6.46 if. Prove Proposition 6.45. [Hint: Use (4.12) or Proposition 6.43.] 



6.5 Series of Products 

In this section we illustrate a few results concerning series of products of 
complex numbers 

oo 

3=0 

In fact, the product structure of the terms helps in giving further results 
of convergence. 

а. Alternating series 

б. 47 Definition. An alternat ing series is a series of the type 
with Uj > 0 for all j > 0. 

6.48 Theorem (Leibniz test). Let ^ be an alter- 
nating series. If {a„} is decreasing to zero, then converges 

and the errors between the sum and the p-th partial sums are estimated by 
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Sn 

ao 

L 

ao — ai 



0 12 n 

Figure 6.18. 

oo 

ap - ap+i < I ^ {-'^ycLj < ap + Vp, q, q > p. 

j=P+i 

The inequalitites are strict if {an} is strictly decreasing. 

Proof. Let p < q eN. It is easily seen using the assumptions that 

q 

< Up + Oq. (6.32) 

3=P 

Given e > 0, we can find n such that |ap| < e for all p >n; (6.32) then 
yields 

q 

^(— l)^Uj < 2 e for all > p > n, 
j=p 

i.e., the sequence of partial sums of ^ Cauchy sequence, 

therefore convergent. The estimate easily follows from (6.32) letting q tend 
to infinity. □ 

An alternative proof of Theorem 6 . 48 . For n G N set Sn := • 

(i) The subsequence S2n, n > 0, of {sn}, is decreasing and bounded below: in fact, 

S2n+2 = S2n ~ a2n+l + fl2n+2 < S2n, Vn > 0, 

S2n = ao — ai + (a 2 — as) -!-••• + (a2n-2 ~ 0,2n-l) H" CL2n ^ O>0 — Cil, 
since {an} is decreasing. 

(ii) The subsequence S2n+ij n > 0, of {sn}, is increasing and bounded above: in fact, 
S2n+1 = S2n-1 + fl2n “ 0,2n+l > S2n-1, Vn > 1, 

S2n+1 = ao — (ai - a 2 ) - (as - a4) H (a2n-l - a2n) + a2n - a2n+l < ao, 

since an is decreasing. 

(iii) Finally 

S2n+1 = S2n ~ a2n+l < S2n, Vn, 

0 for n oo. 




S2n+1 — S2n = ~ a2n+l 
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By (i) and (ii) 



S2n — >■ L G M, S2n+1 — ^ ^ € 5^ 



and by (iii) 



S2n — 52n+l L — M = 0, 



i.e., S2n and S2n+i have the same limit L. Since the indices of S2n (the even integers) 
and the indices of S 2 n+i (the odd integers) exhaust all integers, we then conclude that 
Sn ^ L. Also 

S2n+1 L < S2n, Vn > 0, (6.33) 

hence 



tt2n+l — CL2n+2 < S2n ~ L < S2n ~ S2n+1 = CL2n-\-l, 
fl2n+2 — a2n+3 < L — S2n+1 < S2n+2 ~ «S2n+l = fl2n+2, 



that is, 

oo 

I ^^(— l)'^aj| = \L — Sn-i| < dn for all n G N. 

j=n 



□ 



6.49 Remark. We notice that there exist sequences > 0, fln ^ 0, for 
which does not converge, i.e., we cannot omit the assumption 

that {an} is decreasing. An example is given by the series of alternating 
terms 

(-!)■»„ „ (-1)-^ + 1 

whose partial sums are given by 



j=l J 



i=l 



The series / Vj converges by the Leibniz test, while 

diverges. Therefore 



E oo 1 

3=1 j 



6.50 Remark. The example in Remark 6.49 shows also that the asymp- 
totic comparison test is not valid for series of terms of arbitrary sign. In 
fact. 



(niT 

y/n 



+ h 



(-1)" 

y/n 




1 as n 00 



and the series X2^,(-l)”/\/n converges, but l)"an diverges. 




6.5 Series of Products 



219 



b. Summation by parts 

6.51 Proposition (Summation by parts). Let {a^}, {bn}, be two se- 
quences in C and let Bn := ^ ^ 0? B_i = 0. For arbitrary 

p, q G N, 0 < p < Qj we have 



q q-l 

^ ^ = ^^{Bj — Bp-i){aj — %'4-l) + CLq{Bq — Bp—i). 

j=p j=p 

In particular 

\J2ajbj < sup \Bj - Bp-i \ ] X] |aj - aj+i\ + |ag| [. 

3—P y-J-H j=p 

Proof, Set Cj := Bj — so that (7p_i = 0, and, bj = Cj — Cj 



(6.34) 



(6.35) 

Then 



q q q q-l 

^^cLjbj = CLj {Cj — Cj-i) = Ojj^iCj 

j=p j=p j=p j=p-l 

q-l q-l 

— OqCq ^ ^J+l) O'pCp—X — OqCq ~h ^ Cj{Oj Clj_j_i), 
j=p j=p 

that is (6.34). By (6.34) and the triangle inequality, we finally infer 






q-l 



3=P 



^ ^ (j<^j ~~ -®p-l)(% % + l)^ F Oq{Bq — Bp- 

3=P 

q-l 

< (^\Bj — Bp-i \ \aj — + |ag||Bg — Bp 

3=P 

q-l 

< sup \Bj — Bp-i \ ^ ^ \oj — Uj-i-il + |ng||’. 



P<3<Q 



3=P 



□ 



c. Sequences of bounded total variation 

6.52 Definition. We say that the sequence {on} C C has bounded total 
variation ifYlJLo \^j — converges. 



6.53 Example. Let converge absolutely. Then {un} has bounded total vari- 
ation since for any p > 0 we have “ ®i+il ^ 2 | + |%-i-i|)- 

Notice that the sequence {(— l)’^/n}, which converges to zero, does not have 
bounded total variation since 




(-iy+^ 

j + 1 



^ 1 1 



+ 00 . 
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The following proposition collects a few facts concerning sequences with 
bounded total variation. 

6.54 Proposition. We have 

(i) If {an} C C has bounded total variation^ then [an] converges^ an — > 
£, and 

oo 

\ap - ^ \aj - aj+il, Vp > 0. 

j=P 

(ii) Every real, monotone and bounded sequence {un} has bounded total 

variation and YlJLo = l^o ~ linij_oo CLj\- 

Proof, (i) For p < q we have 

q-l q-l 

j=p j=p 

If \^3 ~ ^i+i| converges, then the sequence 1% “ k eN, 

is a Cauchy sequence, i.e, Ve > 0 such that 

q-l 

Wj ~~ ^i+i| ^ 

j=p 

for p,q>p> Prom 

q-l q-l 

\Uq — Up I = I ^ ~ %-fl) ^ ^ ^ \^j — %+i| (6.36) 

3=P 3=P 

we then infer |up — a^j < e for p, g > p, i.e., [an] is a Cauchy sequence, 
hence Un 

(ii) For instance, assume [an] is increasing. Then 

n n 

^ V |uj — Uj_|_i| = ^ ^ (Uj “ %+l) — 0>Q — Un-l, 
j=0 j=0 




and the conclusion follows when n oo. 



□ 
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Figure 6.19. Lejeune Dirichlet (1805-1859) and Niels Henrik Abel (1802-1829). 



d. Dirichlet and Abel theorems 

6.55 Theorem (Dirichlet). Let {an} and {bn} be two complex se- 
quences. Suppose that 

(i) {un} has bounded total variation and Un — > 0, 

(ii) the partial sums of {bn}, Bn := bounded, \Bn\ < M eR, 

Vn > 0. 

Then YlJLo ^jbj converges and 



\^j %'+i| 



j=p 



for all p eN. 



Proof. Given e > 0, from the assumptions on {a^} we infer that there 
exists p such that 

q 

^ ^ + \^q\ ^ ^ 

3=P 



for all p^q q > p > p. This, together with (6.35) and the assumption on 
{bn} yields 



IE' 

j=p 






< sup \Bj — Bp-i\e <2M 
p<j<q 



that is, ^ Cauchy sequence, hence converges. Letting q oo 

in (6.35) we finally get the estimate. □ 



6.56 Remark. Notice that the Dirichlet test. Theorem 6.55, is an exten- 
sion of the Leibniz test for alternating series. 

6.57 Theorem (Abel). Let {on} and {bn} be two complex sequences. 
Suppose that 
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(i) {an} has bounded total variation, 

(ii) converges. 

Then converges and 



oo oo 

^ ajbj < sup \Bj - Bp-i\ j 2 ^ \aj - Oj+i | + |ap| | 
j>p j=p 



J=P 



for all p>l. 



Proof. By (ii) Bn := ^3 ^ Cauchy sequence: Ve > 0 3p G N such 

that \Bj - Bp-i\ < € for all j > p. By (i) and (ii) of Proposition 6.54, [on] 
is convergent, hence bounded, |an| < M G R Vn > 0. Therefore we deduce 
from (6.35) 






3=P 



3=P 



— aj+i| + ag|. 



(6.37) 



In particular the sequence of partial sums of ^ Cauchy se- 
quence, hence converges. Finally the estimate follows letting ^ > oo in 

(6.37), taking into account 



q-l 

\ciq\ < |Upl + ^ ^ \cij ~ %'+l|* 
3=P 



□ 



6.6 Products of Series 

Let P{x) = ^3^^ ~ Sl=o ^3^^ polynomials. Recall 

that 

P+g 

p{x)Q{x) ='£{Yi 

k=Q i-\-j=k 

where we have set Oj = bj = 0 for all i,j such that p < i < p + q and 
q<j<P + q- 

6.58 Definition. Given two sequences a := {o„} and b := {bn}, the 
product of convolution of a and b, denoted a*b, is the sequence defined as 

n 

{a * b)n •= ^ CLibj = ^ ^ ajbn—j> 

i-hj=n j=0 
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6.59 Example. The product of convolution is extremely useful in operating with se- 
quences. We give a few examples. If 5k, n is Kronecker’s symbol 



^k,n 



1 if fc = n, 
0 a k ^ n, 



and Gfc is the sequence defined by 

Gk •= {5fc,n} “ {Oj fi? Oj • • • ) Ij Oj 0} • • • }j 

we have 



(a * Gk)n 



0 if n < /c, 
an-k if n> k, 



that is, the values of a* ek are the values of {an} shifted k positions on the right, 



a — ■{Q'O} ui 5 5 • • • 5 a,m • • ’ } 

a*ek = {0, 0, 0,..., 0, ao, ui, U 2 , ..., an, . . . }. 

Similarly, if 6 = {1/2, 1/2, 0, 0 0, . . . }, then 



(a * b)n 



ao/2 if n = 0, 

(an+an-i)/2 if n > 1. 



If 6 = {1, 1, 1, . . . , 1, . . . }, then 



n 

(a * 6)n = ^ ak Vn. 
k=o 

In terms of product of convolution, (6.38) can be restated as: the co- 
efficients of P{x)Q{x) are the product of convolution of the coefficients of 
P{x) and Q(x), or, better, of the sequences 

a ^1? ®2 5 • • • 5 } 

b ^1? ^ 2 ? • • • ? bq^ 0, 0, 0, . . . 

P-\-Q 

P{x)Q{x) = J2i^*b)kx\ 

fc=0 

More generally, we have the following. 

6.60 Theorem. Let'^'^Qaj be absolutely convergent Then 

* b)j is absolutely convergent and 



00 00 00 



j=0 



j=0 j=0 
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[p/2] p q i 



Figure 6.20. 




Proof, (i) Set A := l“j l ^ K 9 > P n ;= [p/2] we have 



1 E(“**’)fc| ^ E 


1 El 1 — 




(6.39) 


k=p k=p 


i+j=fc 


p<i+3<q 





< E i“ii E i^’ji + E i'’ji E i“*i ^ E i“*i + ^ E 

i=n 3=0 j=n i=0 i=n j=n 

Given e > 0 we find p such that Yll=n ^ ^ l^j l ^ ^ all g > n > p, 

consequently, on account of (6.39) 



y^(g * h)j < (A + B)e for all qf > p > 2p. 
j=p 

Therefore the sequence of the partial sums of l(® * ^) j I ^ Cauchy sequence, 

hence converges, that is, converges absolutely. 

For p > 0 we also have, similarly to (6.39), 



1 E E - E“j E*’j| = I E 

k=0i+j=k 3=0 3=0 



i<P> 3<P 
i+j>P 



< E i“iii&ii<^Ei^ji+'®EKi 

P<i+J<2p 3='Tt 3=n 



where n := [p/2]. Passing to the limit as p — > oo, we get the result. 



□ 



6.61 Remairk. There seems to be no known necessary and sufficient con- 
dition for the convergence of the series of products. However, one can show 



(i) Abel. converges, then 



oo oo oo 

^(a * b)j = ^ ^ ^ ^ bj ^ , 



see Theorem 7.33. 
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(ii) Mertens. If converges and absolutely conver- 
gent, then ^^Q(c*b)j converges and by (i), we have *^)i “ 

(Er.o%)(Er.oi,). 

(iii) Hardy. If converge and the sequences {nan} 

and {nbn} are bounded, then * ^)i converges. 



6.7 Rearrangements 

Given an ordered enumeration of numbers, we defined their sum in the 
previous section. This notion is useful to define the sum of a denumerable 
set of numbers, but a priori such a sum depends on the order in which 
they are listed. 

A sequence {6^} is a rearrangement of {on} if it contains the same 
elements of {un} listed in a different order. More precisely 



6.62 Definition. We say that {6^} is a rearrangement of d{an} if there 
is a bijective map k :N ~^N such that 

bn = CLkn 

We say that i^ ^ rearrangement of • 



6.63 Theorem (Dirichlet). Suppose that Xl^o^n 

not both divergent. Then every rearrangement 

oo 

E*- 

n=0 

of Yl'^=o -faas the same sum of 

OO OO OO CX5 

^ ^ bn = ^ ^ C,n = ^ ^ *“ • 

n=0 n=0 n=0 n=0 

In particular, in the case of series of nonnegative terms or of absolutely 
convergent series, the sum is independent of the order of the addends. 



Proof. It suffices to prove the theorem in the case of series of nonnegative terms. Let 
A; : N ^ N be a map which reorders {an}, set bn := cLkn ? and an be the n-th 

partial sums respectively of that an, bn > 0, we have 

oo oo 

Sn ^ S '= ^ ^ aj , an ^ S •— ^ ^ bj . 

j=0 j=0 



Since for every n 

o'n = 6q “h 6i “h * * " “h bn — "b a^^^ _l_ . . . _l_ cLk^^ ^ uq ”h ai • -f- ^ S, 

we deduce E < 5. Being that ^ rearrangement of YlfLo ? we also have 

5 < S. In conclusion 5 = E. □ 
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However, this is not true anymore if 

oo oo 

^at = J^aJ= +00. 

j=0 j=0 



6.64 Example. Consider the series 

^ / 1 1\ /I 1 In /I 1 In 



j=0 



+ 



/I 1 In 

Ui - 3 4 j - 1 



<Aj-S 4j-l 2j> 
of positive terms, which is convergent since 



1 11 
0"i •— r + ~rT 



Sj- 3 ^1 



^ ■ 4j - 3 ' 4j - 1 2j 2j(4j - 3)4j - 1 ' 2p ' 

Notice also that 

11 oo 

< ai + 02 < ^2 ^3 ^ 

3=0 

Removing the parentheses we get 



n 

12 



^ I I I I I 

/ bj — l-|- — — — ~“l“' 

■H; ^ 3 2 5 7 4 



3=0 



which is a rearrangement of the series 

^ 1 1 1 1 (-ir 

^ 2 3 4 -f-' n + 1 ’ 

3=0 3=0 



which converges by the Leibniz test to a number L between 1 — 1/2 = 1/2 and 1 — 1/2 + 
1/3 = 5/6 < 11/12. The sums of the two series therefore 

different. 



Of course not all rearrangements change the sums. For instance, a sum 
does not change if we reorder only a finite number of terms. In general, 
however, we have 



6.65 Theorem (Dini-Riemann). Suppose that {an} is a sequence which 
converges to zero and for which 

oo oo 

= +00- 

j=0 j=0 



Then 

(i) for any f E R there exists a rearrangement {bn} of {un} such that 

(ii) There exists a rearrangement {bn} of {un} such that 
determinate. 
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We give an idea of how to define a rearrangement with sum £,0<eeR. 
We begin by adding in order the nonnegative terms a^, until we exceed £ 
(this is possible since ~ point we start to add the 

negative terms —a~ until the sum falls below £ (and this is possible since 
~ then we repeat the procedure. The partial sums of 

the rearrangement constructed this way oscillate around £, and actually 
converge to £ since aj 0. In other words the idea of the proof is: if 
one is allowed for unlimited credits and debts and to freely defer takings 
and payments, then one can decide the threshold of one’s own richness or 
poverty. 

We conclude this section by stating a simple consequence of the above 
concerning double series. 

6.66 Proposition. Given a double sequence {uij} = 0, 1, 2, ... , sup- 
pose that dihsolutely convergent, and if 

oo 

bi:=^^\aij\, z = 0,1,2,..., 

j=o 

converges. Then 

oo oo oo oo 

(6.40) 

i=0 j=0 j=0 i=0 

6.67 11 . Prove Proposition 6.66. Show that (6.40) does not hold in general if we 

only require that converges, that XDSo converges, where this time hi := 

E oo 

j=Q ^ij' 



6.8 Summing Up 

Definitions and basic facts 

Given a sequence {un} of complex numbers, define for every n > 0 Sn := ^j- The 

sequence {sn} is called the series of partial sums of {un} and denoted by ^ 

series is said to be convergent if {sn} has a finite limit, divergent if {sn} has an infinite 
limit, and indeterminate if {sn} has no limit. 

o If ^3 converges, then Un — >■ 0 as n — >> oo. The converse is false, in general, 

o Given a series XZ^o ^3 terms, denote by (f(x) the piecewise constant function 

defined by 

(p{x) = aj if j < X < j + 1. 

Then trivially 




(p{x) dx for all n. 
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Thus partial sums of a series are integrals. This way, the comparison test for integrals 
becomes a means to estimate the partial sums of a series. Moreover, 
verges, diverges, or is indeterminate if and only if ip{s) ds respectively converges, 
diverges or is indeterminate as x oo. 



Series with nonnegative terms 

In this case the sequence {sn} of the partial sums, Sn •= ^3 ^ ^ 

Vj, is monotonically increasing, hence % either converges or diverges. Conse- 

quently, the comparison test^ Proposition 6.21, and the asymptotic comparison test, 
Proposition 6.24, hold. 

The family of the generalized harmonic series 




is useful when using the comparison tests. They converge if and only if a > 1, and in 
this case 



1 

a — 1 



oo 









< 1 + 



1 

a — 1 



The harmonic series ^ diverges. Moreover its partial sums are asymptotic to 

log n, since we have 



^ ^ 1 

log(l + n) < ^ - < 1 + log? 



There are some other useful tests for convergence, 
o Cauchy’s test. Let {an} be nonnegative and decreasing. Then aj converges 

if and only if converges. In this case 







oo oo 

— ^23 • 

j=l 3=0 



o Root test. Let X)^i be a series with nonnegative terms. 

(i) Suppose that there exist K <1 and p € N such that T^i/a^ < RT < 1, for all n > p, 
then XI^i OLn converges and 



^ fln < 
n=p 



KP 



l-K 



(ii) Suppose that there exist K > 1 and p € N such that > K > 1, for all n > p, 
then X)^i diverges. 

o Ratio test. Let o-n be a series with positive terms. 

(i) Suppose that there exist K < 1 and p € N such that an+i/un < K < 1, ioi all 
n> p, then X^^i Un converges and 



^ ^ 0>n ^ 
n=p 



Op 

l-K' 



(ii) Suppose that there exist K > 1 and p € N such that an+ifan > R" > 1, for all 
n>p, then XZ^i Un diverges. 

Actually the root and the ratio tests essentially reduce to a comparison test with the 
geometric series X)^o which we have 



E^^ = 



3=0 



-hoc 

1 

l-K 

is indeterminate 



if R > 1, 
if \K\ < 1, 
UK < - 1 . 
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Absolute convergence 

We say that clj converges absolutely if l®j l converges. In case {an} is a 

sequence of reals , set 

aj := m£ix(an, 0), a~ := — min(an, 0). 

o if ^3 5 ^3 ^ ^5 converges absolutely, then converges and | | ^ 

o if ^ series with real terms, then it converges absolutely if and only if 

both converge, since 0 < at,an < \an\ = at + and 

On — ®n • 

o the complex series Xlj^o ^3 converges absolutely if and only if the four series with 
nonnegative terms ^ and 

converge. 



Series of products 

An alternating series is a series of the type YJLoi~^y ^ where aj > 0 for all j. 

o Leibniz test. Assume that { a - n } is monotonically decreasing to zero. Then the al- 
ternating series converges. Moreover we have the following estimate 

for the error between the sum of the series and the n-th partial sum: 

oo 

I X] {-iyCLj\<an+l- 

J=n+1 

The assumption that {an} is decreasing cannot be avoided. 

Series with general terms which are the product of two quantities can be dealt with 
by two more useful tests, Dirichlet’s test, Theorem 6.55, and AbeVs test, Theorem 6.57. 
Both are applications of the formula of summation by parts. Proposition 6.51. 



Product of series 

Let a := {a-n} and b := {&n}, n > 0, be two sequences. The sequence, denoted by 
{(a * b)n}, 

n 

(a * b)n •— ^ aibj = ^ ^ ajbfi—j 

i-l-j=n j=0 

is called the product of convolution, or Cauchy product, of a and b. 
o Cauchy’s theorem. If converge absolutely, then 

oo 

5Z(a*b)i 

j=o 

converges absolutely and 

oo oo oo 

^(a * b), = ( • 

j=0 j=0 j=0 

This extends the usual formula for the product of two polynomials to a couple of 
absolutely convergent series. 
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Figure 6.21. The problem of the pile of coins. 

6.9 Exercises 



6.68 Compute the sums of the telescoping series 

1 fv 1 

h + j^i0 + i)0 + 2)’ 

6.69 A ball falls from height h onto a rigid surface. It rebounds infinitely many 
times, each time reaching 75% of the height of the previous rebound. Compute the time 
needed in order that the ball be at rest. 

6.70 % von Koch’s curve. Starting from an equilateral triangle, erect an equilateral 
triangle on the middle third of its sides. Iterate the process on each of the sides of the 
polygonal figure obtained this way. The limit closed curve defined this way is called von 
Koch’s curve. Show that the resulting area is finite and compute it. Show that, instead, 
von Koch’s curve has infinite length. 

6.71 ^ Cantor set. A unit square is divided in 9 squares of side 1/3, the central one 
is colored black and the remaining 8 are divided each in 9 squares of side 1/9, and 
each of the central squares is coloured black. By induction we now iterate the process 
infinitely many times. Compute the area of the black region. The complement in the 
unit square of the black region is known as a Cantor set 

6.72 Estimate the error we make replacing 1/j^ with one of its partial sums. 

6.73 A slow caterpillar is crawling on a rubber band at the speed of 1cm per minute, 
but a malevolent elf lengthens the band by Im per minute. Will the caterpillar ever be 
able to reach the end of the rubber band? 



6.74 f . Make a pile of n coins of diameter 1. Dislodge them all in the same direction 
as much as possible and keep them in equilibrium, as shown in Figure 6.21. What is 
the horizontal distance {dn} between the centers of the first and the last coin? 

6.75 Show the following 



Proposition (Asymptotic root test). Let {an} be a sequence of nonnegative num- 
bers. If 



limsup ^^/a^ < 1, 

n— >oo 



then converges. If 

limsup > 1, 

n—*oo 



then diverges. 
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Analytical formulas for tt 

o VIETE. f = ■/I’ + + + ■ ■ ■ 

o WALLIS. f=n~l 
o Gregory, Leibniz, f = 
o Newton, f = (2n+i)2--in+i 

° 2^3 ~ 3^(2n+l) 

° f =E“=o(-1)"2;;^(5?^) “ (239)"+! 

o Area of the unit circle, f = \/l - dx 

o Half the length of the circle, tt = --7---- - dx 

The number e 

o We defined in [GMl] the Euler number e as the unique real number such that 
Jq j dt = 1. We know, see e.g, [GMl], that 

e^ — 1 

D{e^) = e^, equivalently lim = 1, 



hence. 




o e is irrational. 



Viete and Euler formulas for sinx 

C50 00 2 

sinx = ^ n ( j)’ ^ n (1 - ^)- 



Figure 6.22. Analytical formulas for tt, e, and the Viete and Euler formulas for sin x. 




Figure 6.23. The first steps in the construction of the von Koch curve. 
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Figure 6.24. The first steps in the construction of the Cantor set in Exercise 6.71. 



6.76 Show the following 

Proposition (Asymptotic ratio test). Let {an} be a sequence of positive numbers. 
If 

hmsup < 1, 

n—*oo (In 

then converges. If 

limsup 

n— +cx> Un 

then diverges. 

6.77 %. Show that 

(i) if n^(an+i — CLn) — >• 0, then {an} converges, 

(ii) if “n converges, then an/n converges, too. 

6.78 f. Show that ^ converges iff 2 : G C, \z\ < 1 and 2: ^ 1, and that 

converges if and only if 2: € C, \z\ < 1 and z ^ ±i. 

6.79 f Pringsheim’s theorem. Let {an} be a decreasing sequence such that 
converges. Show that nan — > 0 as n — > 00 . 

6.80 f Kronecker’s lemma. If an/n converges, then ^ Y2n=i > 0 as n — > 



6.81 f. Suppose an ^ A and bn B. Show that * b)n A B. [Hint: Write 

an := A + Cn.] 

6.82 f . Let an > 0 and A > 1. Show that 

00 

converges; estimate its sum. [Hint: Write the terms in function of Sn •= 5Zfe=i ®fc-] 

6.83 f. Let {an} and {6n} be two monotone and infinitesimal sequences. Show that 

an sinnt, t G M, and bn cosnt, t 0, converge. [Hint: Use Dirichlet’s test.] 

6.84 Is there a sequence {an} such that converges? 

6.85 f. Let a > 1. Find a lower bound for := Sn=i 



6.9 Exercises 



233 



6.86 ^ Eisenstein series. Study the convergence of the complex series ^ — 



\ z+n 



i). E^=i + E^=o E^=0 where k>2. [Hint: Observe that, 

for r > 0, \z ± n\^ > {n — r)^ for fc > 1, n G N and |z| < r < n.] 

6.87 f. Study the convergence of some of the following series, possibly dependent on 
the real parameter x or on the complex parameter z: 



ne , 
n! 

(2n)! 

(3n)! — (2n)! ’ 
cos 7rn 

nlog(l + n) ’ 

arctan2“’^, 

log^(l + 1/n), 
n 

n arctan n + 1 ’ 



'n? log(l + 
y/ni 

2^n! 

(2n)!’ 

sinn! 



n(n -I- 1) ’ 

{k H" l){k + 2) 
log(l + 1/n), 

(-l)’^sin(l/n). 



6.88 Study the convergence of some of the following series, possibly dependent on 
the real parameter x or on the complex parameter z: 



nTT sin(l/n) — 27T 



(‘ - 



(-1)" 
n — log n ’ 


(logn)-'"<“, 


^ — arctan n^ , 


,cg(w<^), 






(l + xr'i+i/"), x€]-l,0[, 


(_2)ng-nx. 


n 


ZTI 


a z , 


n^’ 


nP' 






1 2 

log . , - 

n + 1 






1 \n^ 

'n? 

1 2 cos(l/n) 

n> 

7T 1 

arctan cos — , 

4 n 



/cos(2/n) \ 
\cos(l/n)/ 
^ - 1 
logn ’ 



1 \n+i 

X arctan n H — ) , 

n^ / 



gx/n _ 

nx ’ 

rl/n 



pL/n . sinx\ , 



rn^ 1 

/ sin^ — dx, 

Jn^ X 

pnl2 

/ (sincc)’^ dx, 

Jo 
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/•n+l o 

(-1)" / t^e“* 

J n 

rn+l 

Vn / — 

•/ n ^ 







sin X dx, 



6.89 i[. For some of the previous series, estimate their sum or the order of magnitude 
of their partial sums. 



6.90 % ^ Raabe’s test. Let be a series of positive terms. Show that, if there 

exists K > 1 such that 

>/C Vn, 

Van+l ' 

then dn converges, while, if 

Vn, 

^Q’Tl+l ' 

then dn diverges. [Hint: In the first case show that Un+i < \{ndn — (n-\- l)an+i); 

in the second show that Un+i > ai/(n + 1).] 



6.91 ^ % Gauss’s test. Let an be a series of positive terms. Suppose that 

dn _ 

Un+I n 

where p > 0 and {On) is a bounded sequence. Then dn converges if JC > 1 and 

diverges if JsT < 1. 



6.92 11. Discuss the convergence of the following series: 

OO I 

n! 



(“ + 1)(“ + 2) • • • (a + n) ’ 
g l-4- 7---(3n-2) 

n=l 



a positive integer. 



3 • 6 • 9 • • 3n 



^ /l-4-7---(3n-2)\2 
3-6-9--3n / 



6.93 1. Let Un = Vn := (-1)^/V^+ 1. The series converge, 

though not absolutely. Show that the product series 

does not converge. [Hint: Show that \wn\> ‘^^+2 ’1 



6.94 1. Let Afc > 0 and < oo. Find a sequence of positive numbers {a^}? 

Qfc — )• OO, such that < oo- 
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The manipulation of series reaches such levels of subtlety that is hard to 
imagine even nowadays in the works of Jacob Bernoulli (1654-1705), Jo- 
hann Bernoulli (1667-1748) and Leonhard Euler (1707-1783), and partic- 
ularly in the Ars conjectandi by Jacob Bernoulli in Introductio in analisin 
infinitorum and in Institutiones calculi by Euler. For instance in Ars con- 
jectandi Jacob Bernoulli introduced the so-called Bernoulli’s numbers^ see 
Section 7.3 below, which may be defined by 






n=0 



Euler proved that 



E 



1 ( 2 ») 



2k 



-\B2k\ 



^ n2fe 2(2fc)! 

that in particular yields 

E l _ 7T^ ^ 1 _ 7T^ 



n=l 

Euler also proved that 



n=l 



^ (-l)n+l ^2fc(22fc_i) 

2^ ^2k /07.M 



n 



n=l 

that yields for example 



{2k)\ 



E 



1 j__ 

^ 02 02 42 ~ 



^2k(^k—l 

^ (2n + 1)^ 2(2fc)! 

“ i-iy 



\B2k\ 



n=l 






^n-l ^ ^ 
“ 12 



1 1 1 
^ + 32 + 52 + ■ ■ ■ - 4 



7T"' 



^ (2n + 1)2 8 ■ 



n=l 



Still Euler, introducing the so-called Euler’s numbers 



1 We have Bo = 1, Bi = -1/2, B 2 = 1/6, B 2 n+i = 0 Vn > 1. 

2 B 2 „-i = 0 Vn > 1 e B 2 n = 42 "+i(B„ - l/4)2«+i/(2n + 1). 
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Figure 7.1. Leonhard Euler (1707-1783) 
and the frontispiece of the Analysis by 
Sir Isaac Newton (1643-1727). 



ANALYSIS 

Per Qu^mitatuni 

SERIES, FLUXIONES, 

A C 

DIFFERENTIJS: 



CUM 

Enumeratione Emearum 
tertii ordinis. 




f. O N D 1 Jj 



1 

coshx 



n=0 



X"' 

n\ 



found that 

^ (_l)n 

E (2n + l)2fc+i "" 22 fc+ 2 (^l^ 2 fc| 

from which 

1-1 + 1- V (-ir 

33 53 ^(2n + l)3 32’ 

Euler, as many others, treated also divergent series finding their asymp- 
totic behaviour^. 

In the eighteenth century the use of series was however quite formal, not 
much attention was given to their convergence, though it was not totally ig- 
nored. It is only in the beginning of the nineteenth century that series were 
treated correctly with Joseph Fourier (1768-1830), Carl Friedrich Gauss 
(1777-1855), Bernhard Bolzano (1781-1848), Niels Henrik Abel (1802- 
1829). 

A satisfactory definition of convergence appeared in the Theorie analy- 
tique del la chaleur^ but the first rigorous definition is due to Gauss in the 



^ Divergent series play a fundamental role in the study of differential equations, as, for 
instance, in the study of the vibrations of membranes with Wilhelm Bessel (1784- 
1846), Enrico Betti (1823-1892), Thomas Jan Stieltjes (1856-1894) and Ernesto 
Cesaro (1859-1906) among others, and starting from J. Henri Poincare (1854-1912), 
in the theory of perturbations of integrable differential systems. 
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JACOBI BERNOULLI, 

Pnofeft EtiQr tc utriulque Socict Birgn Sckndir^ 
G£l & SoijiL 

M^TKIKAFICr ClklUlitlUIp 

ARS CONJECTANDI, 

DFi;S 

TRACTATUS 

DK SERIEBUS INFINITIS, 

EiEmTOL A Cillic^i^pa 

DE LUDO PiLiE 




B A S t LE^> 

Impmfis TH URNISI ORUM. FatriMaip 

^3 larc 



DISQVISITIONES GENERALES 

CIRCA SERIEM lNr£«ITAM 
*+777^^ ..:r(.y+\r ’* "TT^rir, 

I T e T p 1 i 

CitOLO fRIQERiCO CaViSS. 

F A R £ I, 

seaxTATt ax^£ scIextiadv^ tudit v, iak ia tiu 



iXrjfODP^CTJO. 



StAtt, ^wIbIik DnnB«atjfhiM' ptrfonliri faktpfBiDi,1u- 
B ; Cl y. A piwfl. 



m «., fiTjB- 

.. ^ m j Wdi 3 r*-:^it iteaiMtorn 

pttaoiB nq, ptnnTtan Ik<[i -qu»f^ rlaqi* brnritl^ «ulTl 

frNrtt agbm bnc Gju fCa . CjTib} riiHuiVUi 



®«W ltUA^D( Hftiiwi, ifl Cadhit fTl><?3St f*«. j*rt4niEui» rioir- 



Figure 7.2. The frontispiece of Ars conjectandi by Jacob Bernoulli (1654-1705) and the 
first page of the Disquisitiones by Carl Friedrich Gauss (1777-1855). 



paper Disquisitiones generales in 1812, in which Gauss studies the hyper- 
geometric series already discussed by Euler. Finally, it was with Bernhard 
Bolzano (1781-1848) and, especially, with Augustin-Louis Cauchy (1789- 
1857) in his Cours d’ Analyse that the theory of series found its firm basis. 
With Karl Weierstrass (1815-1897) the theory of complex power series 
identifies with the theory of complex analytic functions, a theory which 
is extremely relevant in physics and engineering, as well as in algebraic 
geometry and analytic number theory. In connection with number theory 
we should at least mention Dirichlet series 

oo 

E Qn 
n=l 



the simplest of which is 



CW := E 

n=l 






that defines for 5ft(2;) > 1 Riemann's zeta function, of tremendous relevance 
in studying the distribution of prime numbers as 



C(^) 




p prime 



as well as in studying the distribution of the eigenvalues of differential 
operators. 

Of course our goal in this chapter is a great deal more modest. We shall 
develop the basic theory in the first section, where we see how power series 
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preserve much of the rigidity of the polynomials, and we shall discuss some 
applications in the last two sections, trying to give a flavour of their use 
in the eighteenth century in Section 7.4. 



7.1 Basic Theory 

Given a sequence {an} of real or complex numbers, the power series cen- 
tered at 0 with coefficients [on] is 

oo oo 

:= ao + ^ £ C, 

n=0 n=l 

that is the sequence {sn(^)} of the functions 

n n 

^n{z) =’^a,kz'^ := ao + '^akz'", zeC. 

fc=o fe=l 

We might as well consider the power series centered at zq with coefficients 

oo 

^ ^ >2^0) 5 

n=0 

but of course the two series are related by a simple change of variables. 
Also, If we restrict ourselves to the real axis x, we may as well consider 
the real power series x G R. 

Clearly the series Xln=o converges or diverges depending on the 
choice of z. obvously converges at zero with sum uq. What 

is special of power series is that to each of them is associated a disc of 
convergence such that the series converges if 2 is in the interior of the disc 
(provided the radius of that disc is positive) and diverges if z is in the 
exterior of the disc. 



7.1.1 Circle of convergence 

7.1 Definition. Let be a power series. The number p G R+ 

defined by 

- := limsup 

p n —^00 

is called the radius of convergence of^'^^QOnZ'^. We use the conventions 
1/0“^ = +CXD and 1/ + 00 = 0. 
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7.2 A series ^nz'^ has a positive radius of convergence if and only if {|an|} 

growths at most exponentially, e.g., if and only if there exists M > 0 such that \an\ < 
Vn. 

We have the following. 



7.3 Theorem. Let be a power series with radius of conver- 

gence p>0. Then 

(i) if p > 0, then converges absolutely for any z such that 

\z\ < p; actually, for any 0 <r < p the series Yl^=o converges, 

(ii) Yl^=o does not converge if \z\ > p. 






Proof. The proof is essentially a repetition of the proof of the root test. 

(i) Let t be such that 0 < r < t < p. Since lim sup^__^oQ \/W = V/5 
there exists n such that 

for all n>n, hence 

from which 



^\cin\r^ - 7 ^ 



for all n>n. 



A comparison with the geometric series yields the second claim and the 
estimate 



\an\ ^ Vp > n, 



n—p 



n—p 



(7.1) 



and therefore (i). 

(ii) If \z\ > p, then limsup^_^oo ^\dn\ — \A/ P > 1* From the 

characteristic property of limsup, if /i is a number between 1 and \z\/ p, 
we can find a subsequence {a^^} of {un} such that > h Vn, i.e., 



|afc„| kl''" > /i*"" Vn. 

It follows that \ak^ \ \z^^\ oo. In particular, Onz"^ does not converge to 
zero, consequently by the following. 

Proposition 6.7 the series does not converge at . 2 ;. □ 



7.4 Remark. Theorem 7.3 implies the following characterizations for the 
radius of convergence p of 



oo 

p : = sup 1 1 2 ; I j converges absolutely | 

n=0 

cx> 

= sup {w|E UnZ^ converges I . 



n=0 
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a. The disc and the domain of convergence 

Theorem 7.3 implies that the domain of convergence of a power series, 
that is the set A of points in which it converges, is its disc of convergence 
{z I \z\ < p}, (the open interval ] — p,p[ for real series) union possibly one 
or more points on the circle \z\ = p (one or both of — p, p for real series) 

I |2;| < p| C A C {z I |z| < 1}. 

Let us see a few examples. 

7.5 Example. We already saw several times that the geometric series 
verges if and only if |cc| < 1 and that 

oo ^ 

= l — ' N<1- 

The domain of convergence of the geometrical series is then the open inteval ] — 1, 1[. 

Similarly we proved in Example 6.40 that the complex geometric series 
converges if and only if \z\ < 1. This time the domain of convergence is the interior of 
the disc of convergence, {z\ \z\ < 1}. 

7.6 Example. The power series converges absolutely, hence converges, 

for all X such that \x\ < 1, since in this case \x\'^/n‘^ < X/v? for all n and 1/n^ 

converges. converge if |a;| > 1 by Proposition 6.7 since in this 

case \x\^ /rr — > oo. Thus x^ jn^ converges (actually converges absolutely) if and 

only if |o;| < 1. This time the domain of convergence is the closed interval [—1, 1]. 

With exactly the same computations, one can show that the complex series 
converges absolutely if and only if | 2 :| < 1 and does not converge if | 2 :| > 1. 
Thus the domain of convergence is the disc of convergence union its boundary. 

7.7 Example. We saw in Example 6.13 that 1)^ converges if and only 

if — 1 < X < 1 with sum log(l + x). Thus + 1) has radius of 

convergence p = 1, and its domain of convergence is the half-open interval ] — 1, 1]. 

The complex series same radius of convergence p = 1. 

Thus it converges absolutely if |z| < 1 and does not converge if \z\ > 1. It can be 
proved as a trivial application of the Dirichlet test, Theorem 7.30, that (—1)’^ 

converges at any 2 : such that | 2 :| = 1 except 2 ; = — 1. Therefore this time the domain of 
convergence is {z \ \z\ < z ^ —1}. 



7.8 Example. The power series 



Q>n — 




has coefficients 

if n is odd, 
if n = 2p, 



and radius of convergence l/\/2 since 



2P 

- — limsup = lim ^f/ — = y/2. 

P n-^00 y 

We may also regard as a power series in 2z^. In other words, by 

changing variable, w := 2z^, we get the power series in Example 7.6, 
that converges absolutely for all w such that \w\ = 2\z\‘^ < 1, concluding that 
converges absolutely for \z\ < l/y/2 and does not converge for 

1^1 > I/V 2 . 
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7.1.2 Continuity of the sum 

Let Yl^=zo be a power series with radius of convergence p > 0. For 
any n, Sn{z) := ^ polynomial, it is therefore natural to ask 

how many properties of the functions Sn{z) are preserved when passing 
to the limit. In particular, is the sum of the series S{z) continuous in its 
domain of definition? It is hard to resist writing 

lim S{z) = lim lim Sn{z) (7.2) 

z—yzo z—*zq n—*oo 

= lim lim 5„(2:) = lim Sn(zo) = S'(zo)- 

n—*oo z—*zo n—*oo 

However, it is a fact that the change in the order is not allowed, i.e., the 
equality 

lim lim Sn(z) = lim lim Sn(z) 

z—>zo n—^oo n—*oo z—*zq 

is in general false. For instance, if Sn(x) = x”, x € [0, 1], then 

lim lim x" = lim 0 = 0 and lim lim x" = lim 1 = 1. 

n— +00 x—^1 n—*oox^l~ n—^oo 

(7.3) 

The computation in (7.2) is therefore unjustified. 



a. Uniform Convergence 

The exchange of limits in (7.2) turns out to be correct in case we have a 
uniform estimate (in z) for the error when replacing S{z) by Sn{z). For 
the relevance of this notion it is worth giving the following 

7.9 Definition. Let {/Sn} be a sequence of functions Sn : A C de&ned 
on a subset A ofM?, and let S : A C. We say that {Sn} converges 
uniformly to 5 in if 

Ve > 0 3 n such that 15'n(^) - S{z)\ and Vz G A. 

We say that a series of functions fn{z) converges uniformly in A 
to f : A ^ C, if the sequence of its partial sums converges uniformly in A. 

7.10 Remark. In comparison with the pointwise convergence in A, that 
is Sn{z) ^ S{z) "iz e A, the uniform convergence says (requires) that the 
index n, for which the error \Sn{z) — S'(z)! is smaller than c for all n > n, 
does not depend on the particular point z E A. 

Notice that the definition does not allow us to produce a uniform limit, 
but it only allows us to verify whether a function 5 : ^4 — ^ R is or is not the 
uniform limit of the sequence {S'n}. According to Definition 7.9, computing 
uniform limits is a two-step procedure: 

o first, guess a possible limit function 5 : A — > R, 
o second, prove that S is in fact the uniform limit of {Sn} in A. 




242 



7. Power Series 



By considering the maximal error between Sn{z) and S{z) when 2 ; 
varies in A, 

||Sn-5|loo,A := snp\Sn{z)-S{z)l 
zeA 

we can say, by comparing the explicit definitions, that Sn S uniformly 
if and only if the numerical sequence {||5n — *51100,^} 0 as n 00 . Also, 

from 

|5n(z)-5(z)|<||5n-5||oo,A Vz G A, 

uniform convergence of {5^} to S in A implies pointwise convergence at 
each point z E A. As consequence the pointwise limit is the only possible 
candidate to be the uniform limit. 

The converse of the last claim is false in general, since there exist 
sequences of functions that converge pointwisely but not uniformly, as the 
following example shows. 

7.11 Example. Consider a function ip that is bounded with ||v?||oo,R = -^ > 0 

and such that if(x) — >• 0 as x — > — 00 . Define Sn{,x) := ip(x — n), x € M. We see that 
for any fixed x, Sn{x) 0, hence {Sn} converges pointwisely to 0, but {Sn} does not 
converge uniformly to zero since ||Sn||oo,R = ||^||oo,R = M > 0. 

7.12 1[. Show that the sequence {(pn) of functions ipn : [0, 1] — >■ E given by ip{x) := 

converges to zero in [0, 1] but not uniformly. 



b. Continuity of uniform limits 

7.13 Theorem. Let {5n} be a sequence of continuous functions Sn'> A ^ 
C on A and suppose that {5n} converges uniformly in A to S : A C. 
Then S is continuous on A. 

Proof. Let zq e A and e > 0. Since Sn{z) — > S{z) uniformly, there exists 
n such that \Sn{z) — S{z)\ < e for all z e A. Consequently 

|5(^) - S{zo)\ = \Sn{z) - Snizo) + S{z) - Sn{z) + Sn{zo) ~ 5(zo)| 

< 15„(^) - 5„(zo)l + \S{z) - Sn{z)\ + 15(zo) - 5„(zo)| 

< |5„(2:)-5„(zo)| + 2€. 

Since Sn is continuous, we also find 5 > 0 such that \Sn{z) — *571(^0)1 < ^ 
whenever z e A and 1^;— zq| < S. In conclusion we infer that |S'(2:)— 5(2:o)| < 
3e for all z e A with \z — zq\ < S. □ 

Notice that the assumption of uniform convergence in Theorem 7.13 
cannot be dropped. For instance, the sequence x G [0, 1], converges 

pointwisely to the discontinuous function 



5(x) = 



0 if 0 < a: < 1, 

1 if a: = 1. 
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c. Uniform convergence of power series 

Going back to power series, we have the following. 

7.14 Theorem. Let he a power series that converges abso- 
lutely at Zq. Then converges uniformly in {z \ | 2 :| < l^ol}. In 

particular, if p is the radius of convergence of ^nd p > 0, then 

converges uniformly in {z | \z\ < r} for all r < p. 

Proof. For |z| < \zq\ and n > 1 we have by the triangular inequality 



| 5 (^)- 5 „(^)| 



OO 



E 

j=n+l 






OO 

s E 

j=n+l 



OO 

j=n+l 



(7.4) 



hence 



sup |S'(^) - 5'„(2:)| < Y l“jll^ 0 p^ 0 , 

j=n+l 



n OO. 



The second part of the claim follows from Theorem 7.3. 



□ 



Theorems 7.14 and 7.13 then yield at once 



7.15 Corollary. Let S(z) = be the sum of a power series 

with a positive radius of convergence p > 0. Then S(z) is continuous on 
{z\\z\ < p}. 

Proof. Let zq be inside the disc of convergence, \zq\ < p, and let s be such 
that l^^ol < 5 < p. By Theorems 7.14 and 7.13 the restriction of S{z) to 
| 2 ;| < s is continuous. Consequently S is continuous at zq, since \zq\ < s. 

□ 



7.16 f . Show directly, that is, by using the definition of continuity, that the sum of a 
power series is continuous on { 2 ; | |2;| < p}. [Hint: Let zq be such that l^ol < P- Observe 
that for <7 < p — \zo\ and \z — zq\ < a one has 

00 00 OO 

I \ = I XI “ -2:0)1 

n=0 n=0 n=l 

00 

< I X] ^an{z - zo)(z'^~^ + z'^~‘^ZQ H h zZq~‘^ + -2:o’~^)}| 

n=l 

00 00 

< X] ~ ^o|n(|2:o| 4- = k - zo\ ^ n\an\{\zo\ + .] 

n=l n—1 
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7.1.3 Differentiation and integration 

In this section we deal with the series of derivatives and integrals given 
respectively by 



E nanz'^~'^ and — — 

n + i 

n=l n=0 

and show that in the interior of the disc of convergence the derivative and 
the integral of the sum are the sums of the series of derivatives and of 
integrals 

OO CXD . OO OO ^n+1 

^ n=0 n=l n=0 n=0 

In doing that we deal first with the real power series and then with the com- 
plex power series, since in the latter case, one needs to introduce suitable 
notions of differentiation and integration for functions of complex vari- 
ables. We postpone the discussion of some partial results at the boundary 
to the next section. 



a. Series of derivatives and of integrals 

7.17 Proposition. The power series ^ and 

have the same radius of convergence. 

Proof. Let p and a be respectively the radii of convergence of 

and of nanz'^~^. Let us prove that p = a. We first prove that a < p. 

Assuming cr > 0 since otherwise the claim is trivial, fix 2: such that |2:| < a. 

Since 

\an\\z\'^ < \z\n\an\\z\'^~^ , n>l, 

we deduce that converges absolutely at z by the comparison 

test; thus p> a. 

Let us prove that p < cr. Assuming p > 0, for any fixed jzj < p, let r 
be such that |z| < r < p. We infer that converges, so that 

{\an\w'^} is bounded, \an\w^ < M, hence 

n\an\\z\'^~^ < Mn( - ) 

V|2;| +p/ 

Therefore again the comparison test implies the absolute convergence of 
Yl^=i nanz'^~^ at 2;; 2: being arbitrary, we then conclude that cr > p. 

Finally, radius of convergence p, too, since Xl^o ^^z'^ 

is the series of derivatives of Xl^o ^ 
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7.18 Show that 

limsup ^(n + l)|an+i| = limsup y = limsup ^|an| 

n— ^oo n— ♦■oo V 71 n—KX> 

inferring this way Proposition 7.17. 

Iterating Proposition 7.17 we also conclude that, for every A;, the series 
of the k-th derivatives 

oo 

n(n — l)(n — 2) • • • (n — A: + 

n=k 

has the same radius of convergence of . 



b. Real power series 

Consider a real power series in which {an} C E and x e R. 

Notice that its domain of convergence is an interval of radius p, p being 
the radius of convergence of the series, that can be either open, closed, 
left-closed or right-closed. 

First we state the following simple theorem concerning the exchange of 
limit and integrals. 



7.19 Theorem. Let {5n} be a sequence of functions Sn : [a, 6] — ^ R that 
converges uniformly to 5 : [a, 6] ^ R on the bounded interval [a, b]. Then 





dx. 



Proof. This follows at once, observing that S{x) being continuous by Theorem 7.13, we 
have 



J ^Sn(x) — S(x)^ dx 



< {b — a) sup |5n(a;) — S'(x)| 
a:G[a,b] 

= ( 6 - 



-a)ll^n-5|U,[,,5]. (7.5) 

□ 



7.20 Remark. Notice that both the assumptions of the uniform conver- 
gence and of performing the integrals on a bounded interval cannot be 
dropped in Theorem 7.19. For instance, starting with (f{x) = xe“^, 

o Choosing 5'„(x) := we have ||S'„||oo,[0,oo[ = 0, 

hence Sn converges uniformly to S{x) = 0, but 

pOO pOO 

/ Sn{x) dx= (p{x) dx > 0 for all n > 1 
Jo Jo 

although for any a > 0, 




I 



a/n 



(f{x) dx ^ 0 as n — » OO. 
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o Choosing Sn{x) := ri(p{nx), x G [0, 1], Sn{x) converges pointwisely to 0 
Vx G [0, 1], and 





dt > 0. 



7.21 Theorem (Differentiation and integration of power series). 

Let S{x) be the sum of the power series ^nx'^- If ^nx'^ con- 
verges uniformly to S on an closed interval [a, C M, then 





x'^ dx. 



Consequently, 

(i) assuming that the radius of convergence p of 
we have 



px oo oo 

/ dt = y2an 

n=0 n=0 



r-n+1 



n + 1 



is positive, 



(7.6) 



for all X, |x| < p, 

(ii) 5 6C°°(] -p,p[) and 

OO 

D''5(x) = ^ n(n - 1) • • • (n - fc + l)x"-^ |x| < p. (7.7) 

n=k 

Proof. The first claim and (i) follow at once from Theorem 7.19 since Sn{x) := 
ajX^ converges uniformly to S. 

(ii) Prom (i) and Proposition 7.17 we get 

rx , y rx V 

/ ( nant^~^] dt = (nan / dt 

Jo ^ n=l J^ ^ 

OO 

= ^ a„x” = S{x) -ao = S(x) - 5(0). 

n=l 

By the fundamental theorem of calulus, S is then differentiable on {|x| < p} and 

OO 

S'{x) = ^2 \x\ < p. 

n=l 

The proof is then easily completed by induction. □ 

7.22 Show the following 

Proposition. Let {5n} be a sequence of functions Sn : [a,b] — > R of class C^[a,b]. 
Suppose that the derivatives : [a, 6] M converge uniformly to a function T : [a, b] 

R and that Sn{xo) — ►AGMasn—^oofor some xq G [a, b]. Then 

(i) {Sn} converges pointwisely to a function S : [a, 6] — >• R, 

(ii) S is differentiable on [a, b], S'{x) = T(x) Wx G [a, 6] and S is of class C^{[a, 6]). 
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[Hint: Compare (ii) of Theorem 7.21.^ 



7.23 Remark. Theorem 7.21 extends immediately to series 

with ttn G C and x G R. Assuming that has a positive radius 

of convergence p > 0, we have 



Y, = Y + i Y ^(an)x”, 

n=0 n=0 n=0 

oo oo oo 

n=0 n=0 n=0 

px ^ px oo px oo 

/ y^ant^dt= y^^{an)rdt + i T^an)t^dt, 

n=0 n=0 -^0 

for all X G R, |x| < p, 

7.24 Example. Denote by Si: [0, oo[^ E the function 

o./ X n sint , 

Si(a:) := / dt. 

Jo t 



By Theorem 7.21 we find 



rx o\r\ t /2n 

•'o ‘ •'0 ^0 2n + l 



^2n+l 



(2n+l)(2n+l)! 



X > 0, 



and the following error estimate in the approximation Si{x) 13n=o(“^)’^ (2n+iK2n+i)! 
holds, 

, ^ ^2n+l , ^2p+l 

I (2n + l)(2n + 1)! I " (2p + l)(2p + 1)! 



c. Power series and Taylor series 

Prom (7.7) we infer the following. 



7.25 Theorem. Let be a power series with positive radius of 

convergence. Then it is the Taylor series of its sum S{x) Y!!^=o^nX'^, 
that iSj 



— 



£)^5(0) 



Vn > 0. 



n\ 



(7.8) 



Theorem 7.25 expresses the rigidity of the sums of power series: it 
suffices to know 5(x) in a small interval ] — 5, of the real axis to know 
its derivatives at 0. By Theorem 7.25 all coefficients On are then identified, 
and in turn the sum S{x) on the entire domain of convergence. Explicit 
formulas exist in some cases, but we do not pursue this point which is 
nowadays part of the theory of functions of complex variables. 

Another immediate consequence is the following. 
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7.26 Theorem (Principle of identity of power series). Suppose that 

converge in]- p, p[, p > 0, and that J2n=o 

= E^o on] - p, p[. Then an = bn for all n. 



d. Complex series 

The theorem of differentiation and integration of series extends to complex 
power series provided suitable definitions of “complex derivative” and of 
“integral of a complex function” are given. We do not give here fully general 
definitions, as it would lead us into the theory of functions of complex 
variables. Here we make only a few remarks. Let / :jBcC— >Cbea 
function defined on an open ball B centered at zero in the complex plane, 
B := {;2^||^| < p}. We say that / has complex derivative at zq ^ B if the 
following limit exists in C, 



z-*zo Z — Zq 



fizo). 



We define the integral of f from to z = x-\-iy£B^ 




f 



f {t iO) dt + 



/' 



f{x + it) dt. 



7.27 Remark. Notice the following: 

(i) We have 

Dz^ := dw := Vz € C 

Jo « + 1 

as one can see by a direct computation. 

(ii) Because of the fundamental theorem of calculus, if / : R ^ C has a 
complex derivative that is continuous, then 

f{z)-m= rDf(z)dz (7.9) 

Jo 

as one can check from the definition. 

(iii) Prom (ii) it follows: if / has a continuous complex derivative on B 
and Df{z) = 0 V 2 : G B, then / is constant on B. 

(iv) If g : B ^ C has a complex derivative, then the function f : B C 
defined by 

f{z) := / g{w) dw, 

Jo 

has a complex derivative on B and Df{z) = g{z). This is actually a 
key point of the theory of functions of a complex variable. 

With these definitions Theorem 7.21 extends to the following. 
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7.28 Theorem (Differentiation and integration of power series). 

Let converge with a positive radius of convergence p > 0 and 

let S(z) be its sum, S(z) := Then 

(i) S{z) has complex derivatives of any order in {\z\ < p} and 



oo 

D^S{z) = ^ n(n — 1) • • • (n — fc + l)z'^~^, | 2 ;| < p. 

n=k 



(ii) 




dw = 



n=0 



^n+l 



n + 1 



Proof. Assuming (iv) of Remark 7.27, one can repeat the proof of Theo- 
rem 7.21. Here we present a more direct proof. 

(i) Fix such that \z\ < p and let 5 = 5{z) be such that |z| < 5 < p. 
We notice that for any h with 0 < \h\ < S — \z\, we have 



= r ( E “.(« + ‘j" - E <•»»”) 

n=0 n=0 

1 ^ 

=se».(e(:)^‘^”-‘-") 

n=l k=0 ^ ^ 
oo n— 1 

n=l k=0 

oo oo n— 2 

^ na„n2”“^ + E ( E . 



n=l 



n=2 k=0 



that is, 



S{z + h) -S{z) 



n=l 



< 



u){z, h) 



where u>{z, h) := Yl ^=2 ( Ylk=o • We then estimate 



(7.10) 



u{z, h) by 
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a; 



< 



oo yn—2 ✓ V \ 

n=2 ^k=0 ^ ^ ^ 




in £ (E (” ~t 


) (7.11) 


inEi“"i"'”“^^w + inr-^ 

n=2 


(7.12) 


n=2 





where C G R is the sum of which converges and does 

not depend on h. In conclusion, (7.10) and (7.11) yield 

as /i — > 0. Since z has been chosen arbitrarily in the interior 
of the disc of convergence, we then conclude that S is differentiable at each 
point z with | 2 :| < p, and 



DS{z) = nanz'^ |z| < p. 

n=l 

By induction on k we then infer (i). 

(ii) For |^;| < p and t G [0,1], the complex series of the real variable t 
has radius of convergence larger than 1 and sum S{tz). 
From Remark 7.23 then 

/ S(w)dw = z / S{tz)dt = zy^ anz ”' — = — TT- 

Jo ' ' Jo t'o ^ + 1 ^0 ” + l 



□ 



7.2 Further Results 

7.2.1 Boundary values 

Let be a power series with radius of convergence that we as- 

sume for the sake of simplicity to be 1. As we have seen, if the series 
converges absolutely at some boundary point z, |z| = 1, then it converges 
absolutely at every boundary point. 

In some cases the following test is useful. 
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7.29 Proposition (Absolute convergence at the boundary). Let 

be a power series with a positive radius of convergence p > 0 
and sum S{z). Suppose that for some zq eC with |zo| = p and for each in- 
teger n, UnZQ is a nonnegative real number and that limsup^_i- S{rzo) < 
+CX). Then converges absolutely at all z such that \z\ = p. 

Proof. In fact, if \z\ = \zo\ = p, for any p > 0 we have Yln=o^nZor'^ < 
HZ=o hence 

p p p 

^ |a„| | 2 |" = ^ |a„| l^ol" = 

n=0 n=0 n=0 

oo 

< limsup^an(rzo)^ = limsup5(r zq) < +oo. 

^-^1" n=0 

□ 

Let be a power series with radius of convergence 1. If the 

series does not converge at every boundary point \z\ = 1, then 

at those points at which it converges it does not converge absolutely. The 
following theorem deals with one such case. 

7.30 Theorem (Dirichlet). Let be a series with radius of 

convergence 1. Suppose that the sequence {on} converges to zero and has 
bounded total variationj 



oo 

^ ^ l^n+1 ^n| OO. 

n=0 



Then converges for all z with |;^| = 1 and z ^ 1 and 



n=p 



< 



4 

|l-z 



oo 

l^p+1 ~ 

n=p 






(7.13) 



in particular converges uniformly on every domain Dp := 

{z I |z| < 1, |1 - ^1 > p} for all p > 0. If moreover S{z) is its sum, then 
S(z) is continuous on (z \\z\ < l,z ^ 1} and 



(1 - z) S{z) -^0 as z ^ 1, \z\ <1. 
Proof. For \z\ < 1 we have 



j=0 



1 - 



1-z 



< 



\l-zV 



therefore by Dirichlet’s test, Theorem 6.55, Yl'^=o^nZ" converges and 
(7.13) holds. We therefore infer from (7.13) the uniform convergence of 
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Yl^=o on Dp. This proves the first part of the theorem, and the con- 
tinuity of the sum S{z) on {z 1 < 1, z ^ 1}. 

To prove the second part, for e > 0 choose n in such a way that 
l%+i ~ Sn{z) denotes the n-th partial sum, by (7.13) 

we have for \z\ < 1, 

1(1 - z) 5(^)1 < 1(1 - ^) Sn{z)\ + 1(1 - z) iS{z) - 5„(^))| 

OO 

= 1(1 - z)5„(z)| + (1-2:) ^ anZ^ 

j=n+l 

< 1(1 - z) Sn{z)\ + 4:6. 

Being that (1 — z)Sn{z) is a polynomial, hence a continous function, there 
exists 5 > 0 such that \{l — z) Sn{z)\ < e for |z — 1| < 5, hence we conclude 
for 1^ — 1| < 5 and \z\ < 1 that |(1 — z)S{z)\ < 5e. □ 

Suppose that converges at a point zq of the boundary 

{z I |z| = 1} of its disc of convergence; is its sum continuous at zq? Of 
course the answer is yes if converges absolutely; in this case, 

in fact, it converges uniformly in {>2^ | < 1} by Theorems 7.14 and 7.13. 

The next theorem gives a partial answer in the general case. 

7.31 Theorem (Abel). Suppose Y2'^=oCLnZ'^ has radius of convergence 
1 and converges at zq with \zo\ = 1. Then the series of real powers with 
complex coefficients converges uniformly on [0, 1], and its 

sum s(t) := S(tzo), S(z) being the sum of Yl^=o (^nz'^ , is continuous on 

[ 0 , 1 ], 

Proof. Set a„ := t", (3n := ■= Z)"=o/^«- assumption 

J2n=o converges and 



n=0 



1 if 0 < t < 1, 
0 if t = 1, 



since t E [0, 1]. By Abel’s test. Theorem 6.57, applied with Un := and 
K := we get 



OO OO 



<3 sup |Bj — J5p_i|, 

j>n+l 



where Bp := YX,=o f^P' On the other hand, given e > 0, there exists n such 
that \Bp — Bn\ < e for n,p > n, hence 



Y O-nZof^ = I Y 

j=n+l j=n+l 
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Therefore the series converges uniformly in [0,1], and the 

function s{t) := is continuous in [0, 1] by Theorem 7.13. □ 

In particular, we can state the following. 

7.32 Corollary. Let Yl^=o be a power series with radius of conver- 
gence p> 0. If it converges at zq with \zq\ = p, then its sum is continuous 
when restricted on the closed segment joining 0 with zq. Moreover we have 

nZo pi ^ 

/ 5(w) dw = zo S{tzo) dt = Y a„-2— (7.15) 

Jo Jo 

As a consequence we can prove the claim (i) in Remark 6.61. 

7.33 Theorem (Abel). If and converge respectively 

to A and B and the product series ^kbn-ky converges 

to C, then C — AB. 

Proof. Set f{x) = 9{x) = Yln=o 

0 < X < 1. Since f{x)g{x) = h(x) for 0 < x < 1 <, see Theorem 6.60, 
and f{x) A, g{x) B and h{x) C by Theorem 7.31, the claim 
follows. □ 



7.2.2 Product and composition of power 
series 

An immediate consequence of Theorem 6.60 is the following. 

7.34 Theorem. Let YJ^=o power series with 

respectively radii of convergence pa > 0 and pb > 0. Then the series 
where c„ := radius of convergence pc > 

max.{pa,Pb) and 



OO OO OO 

n=0 n=0 n=0 

for all z, \z\ < min(pa,pb). 

Notice that pc is at least the maximum of pa and pf>. For instance, if 
2^0 ^riz'^ ^ polynomial, then ^riz'^ ^ polynomial, too, hence 

=z p^ = + 00 , no matter the size of p^. 
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a. Weierstrass’s double series theorem 

Suppose that all series 

oo 

^ ^ OjfYtk^ ? 777- = 0, 1, 2, , 

1=0 

are convergent at least for |>2:1 < iZ, i? > 0 and that for every p < R 

oo 

5(^) := 5;] 5^(z) 

m=0 

is uniformly convergent on \z\ < p. Then we have 



7.35 Theorem. The coefficients of the power series of z in the respective 
series form a convergent series and if we set 






oo 

m=0 



the series sums to S{z) at least for \z\ < R. Moreover S{z) is 

infinitely differentiable and 

oo 

nl ' 

m=0 

This follows at once from Proposition 6.66 



7.2.3 Taylor series: examples 

Taylor series of given smooth functions (see Section 6.1) are important 
examples of power series on which one can test the theory. 



7.36 Example. We proved in Example 6.15 that 



oo 



E 



x'^ 

n\ 



E(-i)' 



n=0 



2,2n+l 
(2n + 1)! 



sinx, 



OO 



E(-i) 

n=0 



^2n 



= cosx 



for all X € M, thus the radius of convergence of all these series is oo. Convergence to 
their respective sum is uniform on every bounded interval. 



7.37 Example (Geometric series). We saw at several places that the geometric se- 
ries converges if and only if |o::| < 1, and sums to 




n=0 



1 — X 



|x| < 1. 
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The convergence is uniform on every interval [— r, r] with r < p. Observe that the 
convergence is not uniform, neither in the open interval ] — 1, 1[ nor in the interval 
] — 1,0] in which the sum is bounded, since the quantity 



— X 



n 




I 1 I 

sup p =1. 

a:6] — 1,0] 



7.38 Example (Logarithm). Replacing x by —x in 

1 oo 

- — = N<i’ 

we get 

1 oo 

— = ^(-irx", N<i, 

‘ n=0 

and integrating, 

PZ 1 OO ^ j 

los(l + x)= zr—dt= |x| < 1, (7.16) 

Jo 1 + t Jo ^0 ^0 

an equality that we already know by an ad hoc computation (see Example 6.13). In fact 
there we proved that converges if — 1 < a: < 1, does not converge if 

X = —11, and the equality (7.16) holds if — 1 < a: < 1. 

On the other hand, the radius of convergence of ^ since 1/p = 

limsup^^oo ^1/n = 1. We then conclude that S^q(— 1)^ (converges if and only 
if —1 < X < 1 and 

n+l 

^0 



7.39 Example (Arc tangent). Replacing x by —x‘^ in 



1 

1 — X 



OO 






\x\ < 1, 



and integrating according to Theorem 7.19 we get 



arctanx = f — ^ dt = f (— l)^t^^ dt = (— 1)^ 

Jo Jo 



^2n+l 

2n + 1 



ki < 1, 



(7.17) 

see Example 6.14. There we proved the convergence of the series and the equality (7.17) 
also for X = ±1. 

We can prove the result at x = ±1 by means of the theory. In fact, if a; = ±1 the 
series reduces to ± lZ^i(~l)^ 2n+i converges by the Leibniz test. Then Abel’s 

theorem yields continuity of the sum of the series at ±1, thus, passing to the limit in 
both sides of (7.17) as a: ±1, we infer 



OO 

arctanibl = ± y~^(— 1)^ 

n=0 



(±i)^"+i 
2ti -|- 1 



On the other hand, since |a;p’^+^/(2n + 1) ^ +oo if \x\ > 1, we conclude that 
^n+i 1^1 £ Ij o^nd 



E(-i) 



n=0 



2n + 1 



= arctana:, 



, \x\ < 1. 
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7.40 Example (The binomial series). We claim that 

oo 

(l + xr = 5^Qx", aeR, |x|<l, 

n=0 



(7.18) 



where 



0 _ J 1 if n = 0, 

1 o:(Qi-l)(Qi-2)---(a-n+l) fl > 1 
I r>.\ — 



Notice that D^(l + = a(a — 1) • • • (a — n + 1)(1 + x) 

is the Taylor series of (1 + centered at zero. 

Since 

I (n+i) 1 _ |a - nl 



, hence the series in (7.18) 



we 



-j j— = as n oo 

|d| 

infer, see Example 2.57, ^\an\ 1; therefore the series in (7.18) has radius of 

convergence 1. 

Let S{x) := 1^1 ^ differentiating we then find, similarly to 5.53 

of [GMl], 

(1 + x)S'{x) = aS(x), |a;| < 1, 

hence 

/ S{x) y ^ (1 +x)5'(x) - aS{x) ^ ^ 

V(l + x)“/ (l + x)“+l 

Therefore we conclude that S{x) = c(l + x)*^ for |x| < 1, c being a constant; finally 
from 5(0) = 1 we infer c = 1. 

7.41 Example (The arc sine). Replacing x with — and choosing a = —1/2, in 
(7.18) we get 

vT^ ^ 0 ^ n J 



and, integrating, we get 



arcsm x 



\ n / 2n + 1 (2n)!! 2n + 1 

n=0 n=0 '■ ^ 



(7.19) 



n=0 n=0 ' 

for |x| < 1, and that the series in (7.19) has radius of convergence 1. The series in (7.19) 
actually converges absolutely if \z\ = 1, hence uniformly in {z\\z\ < 1}. In fact, the 
coefficients of the series in (7.19), that we denote by {cn}, are nonnegative, hence for 
all p > 1, 

p p p oo ^ 

|c^| = y2 Cn = lim y2 < lim ^ CnV^ = lim arcsinr = -. 

71=0 n=0 r-^1 '< n=0 r^l 



7.42 Example. Similarly, choosing in (7.18) a = 1/2, we get 

£ Cn) (- 1 )"^” = 1 

n—0 

and the series has a radius of convergence 1. Actually, Cn := (~y^)(— l)’^, 

converges absolutely if l 2 :| = 1, hence uniformly in { 2 : | | 2 :| < 1}. We in fact have Cn < 0 
Vn > 1, hence, for p > 1, 
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^ \cn\ = l- ^Cn = 2- lim_ ^ Cnr'^ 

n=0 n=l r—*l ^_q 

oo 1/9 

<2- lim / V-r)^= lim 2 

r^l- V n / ^ r-^l- 



- - r = 2. 



7.43 Example. The series has radius of convergence 1. Writing n^z 

n(n — l)z'^ + nz'^ = z^n(n — l)z'^~‘^ + znz'^~^ . and summing, we get, for \z\ < 1, 

OO OO OO - - 

^ E ^”) = = • • • . 



7.44 Example. We compute 



Writing 



^n-1 



n + 2 , 3 

7 ^ ' 7’ 

n — 1 n — 1 



multiplying by z"^ and summing we get 



E = E -" + 3. x: ^ = 7^ - 3-log(l - 

n — 1 ‘^—1 n — 1 1 — z 



n=2 n=2 



by using the identities 



00-00 T)-H 

e-"=7^> n 

1 — z n + 1 



7.45 Example. We compute 



OO ^ 

^ (n + l)(n-2)' 



11 11 



(n+l)(n-2) 3n+l 3n-2’ 
nultiplying by z" and summing, we get, for |z| < 1, 

v 1 = J- V V' 

(n + l)(n -2)^ 3z^ n + 1 3 ^n-2 



1 / z 2 -, 3 . -2 

= _(_log(l-z)-.-_--) + _log(l-z) 



Here we used that ^+i “ “ ~ kl < 1* 



7.3 Some Applications 



In this section we illustrate some applications of the theory of power series. 
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7.46 Example. Conversely we may express Iv? as an integral. In fact for 

|x| < 1 we have 



QQ „n 1 /»x rx 4-n—l rx i 4-n pt 

y^ = ylf e-Ut= f f -T,-dt= I 

^ ” Jo Jo n Jo t Jo 



log(l - 1) 
t 



dt. 



This is easily justified since all series in the previous formulas have radius of convergence 
1 and therefore converge uniformly on [0,x] (respectively [x, 0] if x < 0) if |x| < 1. 



7.3.1 Complex functions 

7.47 Complex exponential. We defined in (4.6) the complex exponen- 
tial by 

:= e^(cosy + isiny), z =: x iy e C. 



Proposition. We have 



:=y^ 6 C. 

n! 



n=0 



(7.20) 



Proof. Since the series of sin y and cos y converge in E, we get 

^ „2n ^ ,,2n+l 

= cosy + i Sin j/ = £(-l)”^ + i 

n=0 ’ n=0 ^ 

;^o (2^)’ ;to(2« + l)! ho 

i.e., our claim for >2 = iy, y G M. Therefore by Theorem 6.60 we infer 



pX+iy 



OO TJ OO 






^ n! n\ 

n=0 n=0 n=0 



where 



fc=0 ’ k=0 ^ / 



n! 



□ 



The complex differentiation theorem for series yields also 



De^ 



OO ^n-1 

n 



OO ^n-1 

n! ^ (n — 1)! 

=0 n=l ^ 



E 
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7.48 The complex logarithm. We recall that the principal determina- 
tion of the logarithm can be written as 

log z := log \z\^ioigz, z^ 0 , 

where arg 2 ; G [— tt, 7 t[ and arg 1 = 0. 

Proposition. We have 

log(l + z) = 5](-l)"— , \z\<l. (7.21) 

n=0 



Proof. We observe that for 2 ; 7 ^ 0, 

glog2 _ ^og\z\ z _ ^ 



and the function log 2 ; is continuous on { 2 ; = x -h iy 1 2 / = 0, x < 0}. Similar 
to the proof of the diflFerentiability of the inverse of a real function (see 
Theorem 4.16 of [GMl]), one sees that log 2 ; is differentiable at the points 
at which it is continuous, i.e., on { 2 : = x -h it/ 1 2 / = 0 , x < 0 }, and 



D\ogz=^, z e ^z — x-\-iy t/ = 0, x < o|. 



Integrating, see Remark 7.27, 



pz 1 pz 00 

log(l + ;z) - log 1 = / — - = / 

Jo Z + l Jo 



= E(-ir 



n=0 



n + 1 



dz^ 



< 1. 



□ 

7.49 Show that equality (7.21) holds for all z with \z\ = 1, z ^ —1. [Hint: . Use 
Abel’s theorem and the continuity of log(l -|- z).] 

7.50 Complex trigonometric and hyperbolic functions. Starting from the com- 
plex exponential one defines the complex sine and cosine, and the complex hyperbolic 
sine and cosine by 



+ e 



sinh z = 



2i 

— e~ 



cosh z = 



2 



that actually means by (7.20) 

CX) 

COS2 = ^(-1)” 



,2n 



(2n)!’ 






^2n+l 

(2n+l)!’ 
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cosh 2 : = 






00 

sin z — 



n=0 



^2n+l 

(2n + l)!’ 



Trivially the restrictions of the four previous functions to the real axis agree with the 
corresponding real functions. It is possible to derive several complex “trigonometric” 
identities, that are formally equivalent to the real ones: 



sin^ 2 : + cos^ z = 1, 
sin(—z) = — sin 2 :, 
cosh^ — sinh^ = 1, 
sinh(— 2 :) = — sinh 2 :, 
cosh(i 2 :) = cos 2 :, 



= cos z -h i sin z, 
cos(-z) = cos 2 :, 

= cosh 2 : H- sinh 2 :, 
cosh(— 2 :) = cosh 2 :, 
sinh(i 2 :) = isinz. 



and 



cos(z -i-w)= cos z cos w — sin z sin w, 
sin(z -\-w)= cos z sin It; + cos w sin z, 
cosh(z H- It;) = cosh z cosh w + sinh z sinh w, 
sinh(z + It;) = cosh z sinh w + cosh w sinh z. 



Consequently 



sinz = 2sin(z/2) cos(z/2), 
cos^(z/2) = (1 +cosz)/2, 

.w-i-z . fW-Z 

sm w — sm z = 2 cos( — ^ — ) sm( — ^ — ) , 
cos w — cos z = — 2 sm( ) sm( — - — ) , 



7.3.2 An alternate definition of tt, e and of 
elementary functions 



In [GMl] we defined e, tt, the exponential function and the trigonometric functions 
sin X and cos x using several tricks that involve the infinitesimal calculus. Here we want 
to point out that all these facts can be subsumed by the power series 




that converges absolutely in C. Define the complex exponential function as 



expz := 



00 



E 



z " 
n! 



VzGC. 



With some work, using several theorems, including Cauchy’s theorem about product of 
series, and the complex differentiation of the sums of complex power series, one may 
prove 

(i) ADDITION THEOREM, exp {z w) = (exp z)(exp ti;). 
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(ii) |expz| = exp lexpzj = 1 iff 2 : = iy, 1 / G M, 

(iii) exp z is differentiable in the complex sense infinitely many times with derivatives 
of any order equal to exp 2 :. 

Moreover, if 2 : = x + iO is real, then we have for exp x = ^ ’ 



£)(exp (x)) = exp (x), 
exp (0) = 1. 



In other words the restriction of the exponential to the real axis is the exponential 
function we already know. At this point we introduce the two complex functions 

exp (^ 2 :) H- exp (— i 2 :) . exp ( 12 :) — exp (— 22 ;) 

cos 2 : := ^ ^ sm 2 := ^ ^ (7.22) 

2 2i ^ ^ 

Prom (7.22) we obtain exp {iz) = cos z-\-i sin 2 :, z G C, hence, formally, the Euler identity, 

exp {it) = cos t-\-i sin t. 

Again from (7.22), we infer D^mz = cos 2 :, Dcosz = — sin 2 :, from which, the functions 
of real variable y{x) := sin(x -t- iO) and z{x) := cos(x + iO) are respectively solutions of 
Cauchy problems 

b" + y = o, iy" + y = o, 

b(0) = 0, y'(0) = 1, [y{0) = 1, y'(0) = 0. 

In other words, x — sin(x + zO) and x — > cos(x + iO) are the trigonometric functions 
that we already know. Again from (7.22), we get 



cos 2 ;= ^(-1)" 

n=0 



(2n)!’ 



sinz : 



E(-i) 



n=0 



(2n + l)!’ 



(7.23) 



which furnishes the needed developments. 

Recovering the number tt, and discussing the periodicity of sinx and cosx, x G M, 
from the complex exponential is a litle tricky, but it can be done. One starts proving 
that exp : C — > C \ {0} is onto, and then one observes that exp : C ^ C \ {0} is a 
homomorphism from the additive group C into the multiplicative group C \ {0} which 
is onto but not injective: its kernel is given by 

ker(exp) = G C | exp it; = 

At this point one can show^ that there exists a unique positive number, that we call tt, 
such that 

ker(exp) = 2niZ, i.e., exp {2nk) = 1 Vfc G Z. 

The addition formulas for the sine and the cosine yield the 27r-periodicity of sinx and 
cosx. 

Concluding, we may regard tt as one of 



zeros of sin z = kn, fc G i 



zeros of cos z = — \- kn, 
2 



k G ; 



periods of exp z = ker(exp z) = 2 kn, G Z, 
periods of sin z = 27 tZ, 
periods of cos z = 27 tZ. 



^ See, e.g., the paper by E. Remmert in in H.E. Ebbinghens, H. Hermes, F. Hirzebruch, 
M. Koecher, K. Meier, J. Neurich, A. Prestel, R. Remmert, Numbers, Springer, New 
York, 1988. 
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7.3.3 Series solutions of differential equations 

Power series turn out to be very useful in solving ODEs. Without entering 
the question of when or if an ODE has a series solution or whether all 
solutions can be represented as a power series, we confine ourselves here 
to presenting a few examples. 

7.51 Example. Suppose that the equation 

y" - y = 0 



has a solution of the form 

oo 

y{x) = akx^ 
fc=o 

with a positive radius of convergence. We therefore have 

oo oo oo 

0='^k(k- l)afcX*=“^ - ^ Ofcx'' = - l)ofc - 

k=2 fe=0 k=2 

hence, by the principle of identity of series, 

- l)afe = afe_25 A; = 2, 3, 4, 

For k even, k = 2n, n > 1, we then find 



0>2n '■ 



02n-2 



2n(2n — 1) ’ 

and, for k odd, fc = 2n + 1, n > 1, 

_ 0,2n-l 

« 2 n+i (2n + l)2n’ 

’^00 X 



i.e.. 



0>2n — 



ao 



i.e., a2n+i = 



ai 



(2n+l)! 



Since the series converge on E, we conclude that 



y{x) = oo ^ + “1 iZ 



is a solution of y" — y = 0. 



= ao cosh X + ai sin hx, x G E 



7.52 Example. Similarly, for the equation 

y" -xy = 0, 

assuming that the series J 2 ^=z 0 <^kX^ with positive radius of convergence is a solution 
of the ODE, we find 

oo 

202 + 5Z ((fe + 2)(fc + l)0)j+2 - Ofc-l^X*' = 0, 
k=l ' 

i.e., 

tt2 = 0 , (fc + 2)(fc -f l)afc-|_2 — Ufe-i = A; = 1 , 2 , 3 , . . . , 



which yield 

aSn = 2-3-5-6---(3n-l)3n’ n = l,2, ..., 

^ asn+l = 3.4.6.7---3n(3n+l) ’ n = 1,2, . . . , 

asn +2 = 0, n = 0,1,2.... 
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We find again two series which converge for all a: G 

^3n 



yi 



(x) := 1 + ^ 



j/ 2 (x) := 1 + ^ 



2-3-5-6-- (3n - l)3n’ 

~3n+l 



3-4-6-7---3n(3n + l) 



and such that 



y{x) = aoyi{x) + aiy 2 {x) 
solves the equation for all ao and ai. 



7.53 Example. Consider the equation 

x‘^y" -h {x^ + x)y' - y = 0 

and suppose that the series has a positive radius of convergence and is a 

solution of the equation. In this case 



hence 

In conclusion we find 



0 = ^ k{k — l)afcX^ + ^ kakX^~^^ -h ^ ka^x^ — ^ a^x^ 
k=0 k=0 k=0 k=0 

oo oo 

= — 1) + k — l)akX^ + ^ kakX^~^^ 

k=o k=o 

oo oo 

= - l)afcX*' + ^(fc - l)afc_ix'= 

k=0 k=l 

oo 

= -ao + ^ ({k'^ - l)afc + (fc - l)afc_i^x^, 

k=l 

ao = 0, {k- l){{k + l)afc + ak-i) = 0. 



2Ai 



ZAi / X^ X^ X^ ^ ^ 



e ^ + X — 1 



X > 0, 



i.e., in this case, we find only a one-parameter family of solutions. 



7.54 Example. Consider the equation 

+ 2/ = 0. 

Assuming ^ solution of the equation with a positive radius of convergence 

we find 

oo 

ao + ^ (ak + {k- l){k - 2)ak-i^x^ = 0, 

hence all ak must vanish: the unique solution that is representable by a power series 
with center 0 is the zero solution. 



7.55 f . All ODEs in this section can be integrated explicitly by writing a first integral, 
i.e., multiplying by y' and obtaining a linear first order ODE for z{x) := y'{x). Of course 
not all second order equations can be integrated this way. 




264 



7. Power Series 



7.3.4 Generating functions and combinatorics 

a. Generating functions 

An extremely useful representation of a sequence {un}, n > 0, that grows 
less than exponentially is given by the sum of the real or complex power 
series has positive radius of 

convergence and the sum A{z) := Yl^=o^riZ‘^ uniquely identifies its co- 
efficients, see Theorem 7.25. The function A{z) is called the generating 
function of the sequence {un}. 

If we restrict ourselves to the set of bounded sequences {un}, denoted 
by £^(C), the corresponding series converges to a function 

defined at least in {z \ \z\ < 1 }. Denoting by C the set of all maps a : 
{z I |z| < 1 } C that are infinitely differentiable in the complex sense, 
we then establish a map T : i^{C) C, which transforms every bounded 
sequence a = {an} into the sum of the corresponding power series 



T{a}(z) :=Y,anZ^. 

n=0 

Since T is injective (see, for example. Theorem 7.25), though not surjective, 
see Example 6.12, T{a}{z) gives a different view of the sequence {un}- 
We have 

(i) T is linear, i.e., if A, /x € C and a = {un}, b = [bn] € ^°°(C), then 
Xa H- /x 6 := {Aon -h fJ>bn} G ^^(C) and 

T{Aa + A^5 }(z) = XT{a}{z) + //T{ 6 }( 2 ;), \z\ < 1. 

(ii) If Cfc := {(0, . . . , 0, 1, 0, 0, . . . )} then T{ek}(z) := z'^. 

k 

(iii) If a = {un}, and 



b := {(0, . . . ,0, ao,ai,a2,dots)} 
k 

is the forward shift of k places, then 

oo 

nb}{z) = Yl T{a){z), |z| < 1 . 

n=k 

(iv) If a = [cLn}i and b = [an-\-k}n is the backward shift of k places, then 



T{b}{z) = ^o„+fcx" = -{T{a}{z)-ao-aiz-a 2 Z^ ak-iz'"~^). 

n=0 ^ 
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(v) T transforms the convolution product of sequences into the product 
of the transformed functions, see Theorem 7.34, 

CO oo oo 

T{a * b}{z) = ^(a * ^ ^ = T{a}{z)T{b}{z). 

n=0 n=0 n=0 

7.56 Example. The generating function of the sequence (1,1,1,...) is 

oo - 

differentiating, we get 



oo oo ^ 

r{{n+l}}{z) = 53(n+ = D{ — ) = - 

n=0 n=l 

while, integrating 



{i-zY 



= N<i. 

n + 1 2 : 

7.57 Example. Moreover, if T{a}{z) is the generating function of a = {an}, then 
T{a}{z) 

= = 

n=0 ' fc=0 



00 n 



1-Z 



X: E anz” = ( x: a,)x” = T{a}{z) 



n=0 n=0 

where an := reason 1/(1 — z) is often called the summing operator. 



7.58 . Let a = {a^} be a sequence that grows at most exponentially fast. 
We saw that T is injective, hence it will be possible in principle to re- 
construct {an} from the generating function T{a}{z). Although general 
formulas are available in the context of the theory of functions of com- 
plex variables, it is worth noticing that an explicit formula follows from 
the Hermite decomposition formula when T{a}{z) is a rational function. 
Suppose that 



00 



y^ClnZ”' 

n=0 



A{z) 

B{z) 



in a disc around zero, A{z) and B{z) being coprime polymomials with 
deg A < degS, the roots of B are far from zero, and by Theorem 5.31 



A{z) 

B{z) 



kot 

E E 

a root of B j=l 



(z — ay ‘ 



(7.24) 



Since 

1 ^ ^ (z^ f' (~j\^ 

(z - ay a^ \ a) a^ ^ In / ’ 

^ n=o ^ / 

on a disc around zero, (7.24) and the principle of identity of power series 
yields 
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cin = — 



a root of B 



j=l 



Li^) Vn, (7.25) 



When all the roots of B are simple, we have = 1 and A^,! = A{a)/B'{a). 
Therefore (7.25) simplifies to 



O'n — 



E 

a root of B 






(7.26) 



b. Enumerators 

Generating functions are particularly useful in combinatorics. In this case 
it is customary to change slightly the terminology. 

7.59 Definition. The generating function of a sequence {un} of combi- 
natorial numbers is called the enumerator of {un}. 



7.60 Combinations. The enumerator of the combinations, i.e., of nonordered 
samples without replacement, in a population of n distinct elements, 

\o if fc > n, 

is (1 -h x)’^, since by Newton’s binomial 




(1 + x)^. 



7.61 Combinations with repetitions. The enumerator of combina- 
tions with repetition, or nonordered samples with replacement, from a 
population of n distinct elements, := fc > 0, is (1 — x)“^. In 

fact (see Example 7.40), 




n + k — 
k 




n 



Enumerators are truly useful since one can code easily several selection 
rules and constraints. Let us start with some examples. 



7.62 Example. Prom three distinct objects a, 6, c, there are three ways to sample one 
object without repl 2 icement, namely a, b or c, three ways to sample two objects without 
replacement, a6, ac, be, and only one way to choose three objects, namely abc. 

By considering the polynomial (1 + ax){l + bx){l + cx) and observing that 

(1 + ax)(l -h 6a;) (1 + cx) = 1 + (a + 6 -j- c)x + {ab + 6c + ac)x^ + {abc)x^ 

we see that, replacing “or” by -|- and “and” by • the coefficients of the polynomial 
enumerate the simultaneous selections of 0, 1, 2, 3 objects. 
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7.63 Example. The parallelism in the previous example is not casual. Given the four 
objects a, a, b, c there are three ways of sampling one object (a, b and c), four ways of 
choosing two objects {aa,ab,ac,bc), three ways of choosing three objects aab, aac, abc 
and only one way of choosing four objects (aabc). If we encode the population a, a, b, 
c by the polynomial 

P{x) : = (1 -|- ax + + bx){l + cx) 

= 1 + (a H- 6 + c)x + (a^ + a6 + ac + bc)x^ + {a^b + a^c + abc)x^ + a^bcx^, 

we see that the coefficient of x'^ enumerates the selections of r objects. In particular 
there are four ways to select two objects, and three ways to select three objects. 

7.64 Example. The mechanism is even more general, as we can include contraints on 
the allowed selections. Still with the population a, a, b, c, we see that, if we want to 
count the selections which contain 6, it suffices to consider the polynomial 

(1 + ax + a^x^) bx (1 + cx) = 6x + (ab + bc)x^ + (a^b + abc)x^ + a^bcx^ 
to enumerate the possible selections which contain b. 

It is therefore conceivable that we can code the population and the 
constraints on the element to be selected in a polynomial and leave the job 
of enumerating the selections to the algebra of polynomials. The previous 
examples actually suggest how to construct the enumerating polynomial. 

Consider a population of N distinct elements ai, a 2 , . . . , a^v but each 
with multiplicity possibly infinite. For each of the a^’s consider the power 
series 

oo 

Si{x) 

n=0 



where 

[ 1 if ai may appear n times in the selection, 

[O if ai is not allowed to appear n times in the selection. 

The product of these series, one for each distinguishable element of the 
population, all converging in ] — 1, 1[, Si{x)S 2 {x) • • • is the enumer- 

ator of the drawing. 



7.65 Example. In the case of combinations without repetitions of a population of 
n distinct elements, that is of unordered samples without repetitions, each element 
may appear at most once. Thus its enumerator is (1 + x) and the enumerator of the 
combinations without repetitions is 

(1 -h x){l H- x) • • • (1 -I- x) = (1 + x)’^. 



n times 



7.66 Example. In the case of sampling with replacement, the population has n dis- 
tinct elements, but each can occur with arbitrary multiplicity. The enumerator of eaeh 
element is then 

2 1 

1 -I- X -I- x^ H = 

1 — X 

and the enumerator of unordered samples with repetitions is 
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(£*‘) ■■■(£«•) -(f:-r-(r^r 

k=0 k=0 k=0 k=0 



7.67 Show that J2r=oi~^y 0) ~ ways of choosing an even or an odd 

number of objects is equal, and equal to 2^“^. 

7.68 1. Show that ELo & = ("„")• + ^)”(1 + ^)” = (^ + 

7.69 K. Prove Vandermonde’s formula using the identity 



c. Exponential enumerators 



For sequences {an}^ and especially for sequences which grow faster than 
exponentially, it is worth computing the enumerator of a rescaled sequence. 
For instance the exponential enumerator of the sequence {an} is the sum 
of the power series 



n=0 



n! 



7.70 Arrangements without repetitions. The enumerator of the or- 
dered samples, or arrangements, without repetitions of n distinct objects 



Dt ■- { 
10 



if A: < n, 
if fc > n 



of n distinct objects is 



E 

fc =0 






= 1 + 



n! 






n! 



(n — 1)! {n — 2)V 



+ h n\x^ 



which unfortunately has no simple closed form. However the exponential 
enumerator of the same sequence is 



^ nfc 



fc =0 



7.71 Arrangements with repetitions. The exponential enumerator of 
the ordered fc-samples of n with repetitions of n distinct objects, 

= n^, is 



oo 



E 




= e 



nx 




7.3 Some Applications 



269 



As for nonordered samples, one can build easily from the population 
and the rules of selection the exponential generator for nonordered samples, 
leaving the computation to the algebra of the power series. 

Suppose the population is made up of N distinct elements, ai, U 2 , . . . , 
ajv? each with infinite multiplicity. For each a^, consider the power series 






n=0 



where 

if ai may appear n times in the selection^ 

^ [ 0 if Oi may not appear n times in the selection. 

Then the exponential enumerator of the ordered samples with repetitions 
is Si{x)S 2 {x) • • • Sn{x). 

7.72 Check that the previous rule yields the right result in the case of permutations 
with or without repetitions. 



7.73 f . Show that the exponential enumerator of the permutations of 
o p identical objects is 

o two objects of one type and three of another type is 



d. A few location problems 

The exponential enumerator is particularly useful when locating distinct 
objects into cells. 

7.74 Distributions onto distinct cells and surjective maps. As we 

have already stated, locating k different objects in n different cells is equiv- 
alent to fixing a map from X := {1, . . . , A:} into Y n}. Thus, the 

number of ways of placing k distinct objects in n distinct cells with no cell 
left empty is equal to the number 5^ of surjective maps from X into Y. 
By Proposition 3.38, 



j=0 

We give an alternate simple proof based on the use of the exponential 
enumerator. Since we have n cells and each cell may contain an arbitrary 
number of objects larger than 1, the exponential enumerator for each cell is 

x + x2 + x3 + ... = ^|^ = e^-l, 
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hence the exponential enumerator of the distribution in n cells is 

(e* - 1)" = ^ 

3=0 



n / \ oo u 

i=0 fc=0 

oo n / \ 

= E(E ; 

fc =0 j =0 ^ 



(7.27) 



The number of fc-permutations of the n cells being the coefficient of 
we find again the value of 5^. The numbers 



S(k,n) :=is‘ = igQ(-l)J(„-3)‘. 

are called Stirling numbers of second kind and (7.27) can be rewritten as 

= (T.28) 



k=0 



7.75 Distributions into indistinct cells. Since there are n\ ways of 
distinguishing n objects Stirling number S{k^n) is the number of ways of 
placing k distinct objects into n nondistinct cells, all containing at least 
one element. 

We also saw that there are ways of placing k distinct objects into n 
distinct cells, when empty cells are allowed. However, the number of ways 
of distributing k distinct objects in n nondistinct cells with empty cells 
allowed is not n^jn\. It is 

5(fc, 1) + 5(fc, 2) + h 5(fc, n) for fc > n 

and 

5(fc, 1) + S'(fc, 2) H h 5(fc, k) for k <n, 

i.e., in both cases j)- ways of distributing k 

objects in n nondistinct cells with empty cells allowed equals the ways of 
distributing the k objects so that one cell is not empty, or two cells are 
not empty, etc. 

If n > fc the number of distributions of k objects into 

n-distinct cells has a closed form. In fact, since S{k^n) = 0 for n > fc, we 
have 

k oo 

Y,s{kj) = Y,s{k,j) 

3=0 3=0 
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consequently 



oo ^ J 



j=0 j=0 n=0 ^ / 3=0 n=0 ^ 



OO / OO .U ^ OO .jL 



j=o j=0 

Moreover, from (7.28) 



t 



1 nk 

^ , 

f- 



i=o 



(7.29) 



fc! 



oo k j^oooo ^oooo 

EE»(''.j)|r = EEs^j)!, = EEs(<^. j)’ 

/ e = Oj =0 fc =0 j =0 j=0 k=0 

^ ^ (e^ - 1)-^ ^ 



i=o 



j! 



(7.30) 



e. Partitions of a set 

The exponential enumerator is very useful also when dealing with parti- 
tioning. Let Xk := {1, 2, 3, . . . , A:} be a set with k elements. A partition 
of Xk is a decomposition of Xk into a finite union of disjoint subsets 
Cl, C 2 , . . . , Cp. We denote by P := {Ci, C 2 , . . . , Cp} a partition and by 
P{Xk) the family of partitions of Xk> Two partitions {Ci, C 2 , . . . , Cp} and 
{Di, D 2 , . . . , Dq} are different if p ^ or, if p = g and for any permuta- 
tion a of the indices we can find i such that Ci ^ The number of 

partitions of Xk, called the fc-th Bell number^ equals the number of distri- 
butions of k distinct objects into r cells allowing empty cells, r > k, hence 
by (7.29) 

00 .. 00 

imfe)i = E' 5(fc.j) = -E^- 

j=o 3=0^' 

We can also find such a number by means of the exponential enumerators. 
For that we first state 

7.76 Proposition. Let u{x) := YlT=i be the exponential enumera- 
tor of the sequence {on}, Oq = 0. Then 

00 . 

12 n«ic^i> 

fc=o ' P£P{Xk)CeP 

\C\ being the cardinality of C. 

Proof. In fact 
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p'^{x) . 



E Vp \X) v“^ 1 / X 

j=0 3=0 k=l 

oo ^ 

”E'^( E o,k^ak2 • ■ • dk, 



jz=zQ fci,fc 2 ,. •>^i=l 500 



^fci+fc2H l-A;j V ^ rpT 



r=l 



where 



Ar * — ^ ^ ^ ^ ^k\^k2 * ‘ ’ ^k 



rl 



j = l ki+k2+-- + kj=r 



^ j\ki\k2\--kj\' 



(7.31) 



We may interpret fci, ^ 2 ? • • • ? kj as the cardinalities of a partition P = 
{Ci,C 2 , . . . , Cj) of {1,2,..., r} and let Pj^ki,.,.,kj be the set of partitions 
with j subsets of cardinality fci, . . . , fcj. Since the number of partitions of 
(1,2,..., r} in j subsets, Ci, C 2 , . . . , Cj, with cardinality fci, . . . , kj is 

r\ 

ki\k2\--kj\ 

(see, for example, 3.51), we conclude from (7.31) that 



Ar 



E E 






( n “ici) 

ceP 



E n “i<^i 

PeP(Xr) ceP 



□ 



To compute the number of partitions of Xn, we now choose a/c = 1, 
k >1, hence u{x) = YlT=i o>k^^/k\ = e® — 1 and Proposition 7.76 yields 



n=0 



£2 

'n! 



^ \P{X„ 

On the other hand by (7.30) 

«'■-' = EE*fti)|r' 

fc=0j=0 

therefore |P(X„)| = S{n,j), and, by (7.29), |P(X„)| = i 



7.77 11. If in Proposition 7.76 we choose = # of trees with k vertices and u{x) is 
the corresponding exponential enumerator, then the coefficients Ak in 






= 

fe =0 



X"' 

Id 



represent the forests of trees with k vertices. 
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7.78 Partition of integers. In how many different ways can we decom- 
pose an integer as a sum of integers? For example the number 4 can be 
decomposed as 4, 3 -f- 1, 2 -h 2, 2 + 1 -|- 1, 1 + 1 + 1 + 1. This was one of the 
problems discussed by Leonhard Euler (1707-1783) in terms of generating 
functions. Clearly, a partition of n is equivalent to a way of distributing n 
nondistinct cells with empty cells allowed. It is not difficult to realize that 
the generating function of the sequence {p{n) \p{n) := # partitions of n} 
is given by 

oo 

= {I + X + ) (7.32) 

fe=o 

• (1 + + • • • ) 

• (1 + + • • • ) 



OO 



= n 



1 

1 — 



Similarly, observing that 

1 - hx - hx ^- hx^H = — = (1 + ^)(1 + ^^)(1 + • • • (1 + 

1 — X 

we infer that any integer can be expressed as the sum of a selection of 
nonnegative integral powers of 2 (without repetitions) exactly in one way, 
i.e., every decimal can be represented uniquely as a binary alignment. 



Ut non-finitam Seriem finita coercet, 

Summula, & in nullo limite limes adest: 

Sic modico immensi vestigia Numinis haerent 
Corpore, & angusto limite limes abest. 

Cernere in immense parvum, die, quanta voluptas! 

In parvo immensum cernere, quanta, Deum! 

Jacob Bernoulli^ 



^ Even as the finite encloses an infinite series 
And in the unlimited limits appear, 

So the soul of immensity dwells in minutia 
And in narrowest limits inhere. 

What joy to discern the minute in infinity! 

The vast to perceive in the small, what divinity! 

From Jacob Bernoulli, Tractatus de Seriebus infinitis Earumque Summa Finita et 
Usu in Quadraturis Spatiorum & Rectificationibus Curvarum, in Ars Conjectandi 
(Translation by Helen M. Walker, from A Source Book in Mathematics by D. E. 
Smith, 1929). 
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7.4 Further Applications 

In this section we illustrate some applications, which go back to Euler and 
Johann Bernoulli (1667-1748) and are quite relevant in several contexts. 



7.4.1 Euler-MacLaurin summation formula 

In this section we illustrate a general method of approximating sums found 
by Euler and later rediscovered by Colin MacLaurin (1698-1746). As a 
consequence we find the asymptotic development of the factorial n! and of 
the partial sums of the harmonic series, Hn := Ylk=i 



a. Bernoulli numbers 

As we shall see, the Taylor series of the function 

z ^ 0, 



p(z) := 

ll, z = 0 

with center in the origin plays an important role. Prom the theory of 
complex functions one infers that g has a Taylor expansion with radius of 
convergence 27 t, so we can write for \z\ < 27 t, 






(7.33) 



j=o 



The numbers {Bj} are called the Bernoulli numbers. From 

OO j 

0 = 



n=0 






zn+l-j 



n=0 j=0 
OO n 



j! (n + 1 - j)! 

n+1 



— Z 



n=l 7=0 \ / 



(n + 1)!’ 



we see that they are characterized by the implicit recurrence relation 

(7.34) 



I Bo 1 , 

from which we can easily compute a few values of Bn- 



Bo — 1 , Bi — B 2 = -, Bs = 0 , Bi = —^5 B^ = 0 . 
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7.79 . Convergence and equality in (7.34), and other consequences, can be inferred 
starting from Euler’s formula for cotz, compare (6.30), 



5 cot z = 1 — 2 ^ 



k'^TT^ — z'^ ’ 



\z\ < 7T. 



On account of the Weierstrass double series theorem, we infer that 



-.cotz = 1-2^ 






= 1-2E 



k—1 fc^TT^ 



oo oo 9 

fc=l j=0 
oo 

= l-2^a2jZ^^' 



J + 1 



where 



a2j : 



1 oo - 
«=1 



The equality (7.36) holds on \z\ < tt. On the other hand 



- 1 



+ r = 



, + 1 _ 2 : + e 

’ — 1 ^ 

cosh(f) 2 ; 

f = - COth . 

2sinh(f) 2 V2 



= ii;cot u;. 



where 2iw \— z. Equating (7.36) and (7.37), we then infer 



(-ly 

4-^’ 

j=i 

near zero, hence ^/(e^ — 1) has a Taylor expansion centered at zero. 



^=i-j-25:a2x^=i-^-2x: 



OL 2 jZ 






1 



j=o 



where Bq = 1, Bi = 1/2, and, for j > 1, B 2 j+i = 0 and 

.= iziliil v j_ 

(2j)l ■ 22j-l7r2j fe2j 

|B2n| 



In particular 



12 l.2n - 2 



1 1 



^ fc2 (27 t)2 






(7.35) 



(7.36) 



(7.37) 

(7.38) 



(7.39) 



(7.40) 



(2n)! (27 t) 2’> fc2^ 

from which we infer that 

j=i 

has radius of convergence at least 27T. Since S(z) and z/{e^ — 1) have both complex 
derivatives on \z\ < 27t, we conclude that 
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~T ^ T = l2|<27r. 



Notice that (7.40) yields 



E ^ = (-1)^ 






(2j)! ’ 



J>1; 



in particular, 



OO - 9 OO - 4 

1 _ 7T^ ^ 



b. Bernoulli polynomials 

Bernoulli polynomials are defined by 

Bn(x):=^(”)Bfcx”-^ xeR. 

It is not difficult to show that the exponential enumerator of 
te^t/{e^ - 1), i.e., 



f pXt fU 

= |t|<27T. 

k=0 

They satisfy the relations 

Bn(x + 1) - Bn{x) = nx"~^, 
DBn{x) = nBn-i{x). 



In particular 



Bq{x) = Bo = l, 




dx = 0, 



B„(l) = B„(0) = Vn > 1, 
Jo ^ 



To prove (7.43) we compute 



J](B„(x + l)-B„(x))- = te"* 



OO 




n=l 



(n-l)! 



(7.41) 
(„(x)} is 

(7.42) 

(7.43) 

(7.44) 

(7.45) 



then it remains to equate the coefficients of for all n. To prove (7.44) it 
suffices to differentiate (7.41), while the rest is then trivial. 
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Figure 7.3. The normalized Bernoulli polynomials Bn(x)/ Bn, respectively (a) n = 2, 
(b) n = 4 and (c) n=6. 



7.80 . Prom (7.43), we infer 



m— 1 n 

^ + 1) “ 5^+1 (j)) = (m + 1) 

j=l j=l 

that is, 

oo .j 

fc=0 



7.81 . Using the properties of Bernoulli polynomials, in particular (7.44) 
and (7.45), one proves inductively on n (and we leave it to the reader) that 
the only possible minimum and maximum points for J52m(^)j x G R, are 
0, 1 and 1/2. Prom 



^ Of p'^ 1"^ nf rp 



we compute 

= ( 2 ^- 2 - - 1 ) 52 ^ 



consequently 
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c. Euler-MacLaurin formula and Stirling’s approximation 

7.82 Theorem. Given nonnegative integers g, m > 1, p < q, and a 
smooth function f, we have 



where 



q-± TTl „ 

k=p “'P fc=i 



+ Rm 



(7.47) 



D'^f{x) dx. 



Proof. To prove it, we first observe that it suffices to prove it in the case p = 0 and 
q = 1, because we can then replace f by f{x h) for any integer h getting 



rh+l ^ D, 

/W=/ 



f(^) 



/l-f“ 1 



- (-1)"^ [ 
Jh 



Bm(x - [x]) 



m! 



D^f(x)dx; 



summing in h on the range p < h < q, we then get (7.47), since intermediate terms 
telescope nicely. The proof when p = 0 and q = I, i.e., of 

/(O) = £ fix) dx + + Rm, 

Rm := (-1)™+! f ^IH^D”'fix)(ix 
Jo ml 

is by induction on m. For m = 1 it amounts to proving 

/(0)= [' fix) dx-lif(l)-m)+ f\x-l/2)f'ix)dx 
Jo ^ Jo 



which is just 

/(I) + /(O) _ f _ i/2)f{x)) dx = f f{x)dx-\- f (x - l/2)f'{x) dx. 

2 Jo Jo Jo 

To pass from m — 1 to m, m > 1, we need to show that 

Rm-l := ^D^-^fix)\\R,n 
ml 10 



which reduces to 

i-l)’^ BmD”^-^fix)^^^ =rn£ Bm-iix)D^~^ fix) dx + £ Bmix)D^fix)dx. 

As previously, taking into account (7.44), integrating by parts we see that this holds if 
and only if 

(-!)"> = Bmix)D^-'^fix)f^, 

i.e., if and only if 

{-irBm = Bm{l) = Bm{0) Vm > 1, 

that we know to hold since BmW — Bm(0) = Bm, and Bm = 0 for m odd. □ 
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On account of (7.46) and (7.40), we can easily evaluate the remainder 
and rewrite the Euler-MacLaurin formula as 



k=p P k=l ^ ^ 






(7.48) 



with 



\R2m\<^-^^j\D^^f{x)\dx. (7.49) 



If D‘^'^f{x) > 0 in \p^q]^ ^/(^) is increasing and the integral 

j^g|. £) 2 m-i ^ therefore, using (7.46), we can esti- 

P p 

mate the remainder by 



\R 



/ r \B2m{x-[x])\^^2m 

^ y, — (SS)T 



\D^'^f{x)\dx 



< 



I |-®2m(^) I loo, [0,1] 



(2m)! 



Jp 



dx 



(7.50) 



^ B2m jy2m-l 



(2m) ! 



/(^)r. 

P 



7.83 . An interesting application of the Euler-MacLaurin formula is to the study of 
the asymptotic development of Ylk=i 7(^) when n oo. The structure of the Euler- 
MacLaurin formula is 

n — 1 m 

m = ^n) - F{1) + J2(1'k(n) - Tfc(l)) + Rm{n) 
k=l k=l 



where 



Rmin) := r ^F!^:^^-^D”^f{x)dx, 
Jo ml 



etc. Assuming D'^f{x) = 0{x^ as a: ^ oo for large m, it is not difficult to see that 
Rm{n) is not small for large n, but has only a small tail, i.e., 



I Rm{n) = Rm{oo) + Rm{n), 

\s;:(n) = (-1)-+! ^’"^^7M) D”^/(x)cix = 0(n=+i-"»). 

Therefore we can conclude that for a suitable constant C we have 

n — 1 m 

Y, m = F{n) + c + 5^; Tfc(n) + ^(n). 
k=l k=l 



7.84 Example (Harmonic series). We apply 7.83 to /(x) := 1/x. Since 

dV(x) = (-1)''^!/®'“+!, 



we deduce 
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n — 1 - D ^ D 

?2m J(j.) > ( 

|H;,(n)| < 



fc — o - . ^ 2fcn2*= 

for some constant C. Since D^'^f{x) > 0 Vm, we can estimate the rest by 

B2m+2 

(2m + 2)n2"^+2 ’ 

adding 1/n and observing that C is the Euler-Mascheroni constant 7, see Example 6.26, 
we conclude 



n - 1 ^ D 

- E - E ^ 



- S 2 m -|-2 



^ 2kn?^ ’ (2m + 2)n2’^+2 



for some 6m, n with \9m,n\ < 1- 



7.85 Example (Stirling approximation). Similarly to Example 7.84, on account of 
Stirling’s formula in Example 2.67, we can state 

1 B2k 

logn! = nlogn - n + - logn + log g 2 fc(2fc - T) n^^-i 

, a B2m+2 

(2m + 2)(2m + 1) n'^rn+i 

and \0m,n\ < 1- 



7.4.2 Euler T function 

The gamma function, r(x), defined by Euler in 1729, is surely one of 
the most important special functions, as it unexpectedly appears in many 
topics in analysis. 



a. Definition and characterizations 

For 0 < X < 00 , r(x) is defined as 




dt. 



7.86 Proposition. F(x) G R+ and, for all x g] 0, oo[, 

o r(x + 1) = xr(x), 
o r(l) = 1, r(n + 1) = n\, 

o logF(x), X g]0, oo[ is convex on ]0, oo[, in particular V is a continuous 
function. 

Proof (i) follows easily integrating by parts. Clearly r(l) = 1, thus (ii) 
follows from (i) by induction. Applying Holder’s inequality (see [GMl]), 
one easily obtains 

r(^ + f)<r(x)^/^r(2/)V«, 1 + 1 = h 

\p qJ p q 

that is equivalent to (iii). □ 
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Actually these three properties characterize r(x) completely. 

7.87 Theorem. Let f :]0, oo[^]0, oo[ be a function such that 
o f(x-hl) = xf(x), 

° /(i) = 1, 

o log / is convex. 

Then f{x) = T{x) \fx > 0. 

Proof. It suffices to show that (i) (ii) (iii) uniquely determines f{x) for all x > 0 and 
actually because of (i), for all x G]0, 1[. Set (f := log/. Then 

V?(x H- 1) = ^{x) + logx, (p{l) = 0 and (p is convex. (7.51) 

By induction we see that v?(n + 1) = logn! for all integers n > 1. Since (p is convex (see, 
e.g., [GMl]), we have for 0 < x < 1, 

(p{n + 1) - (p{n) (p{n + 1 + a;) - (p{n + 1) 

log n = ; < 

1 X 

< y(n + 2)-y(n+l) ^ ^ ^2) 

while iterating the first of (7.51), 

(p{n + 1 + x) = (p(x) + log[a;(x + 1) • • • (n -|- n)]. 



Subtracting logn in (7.52), we then get 



0 < <p{x) - log — < 

^ x{x + 1) • • • (x 4- n) 



xlog ^1+ 



Since the last term tends to zero as n — ^ oo, (p{x) is uniquely determined. 



7.88 Gauss’s formula. In the proof of Theorem 7.87 we have in fact 
proved that 



or, equivalently 



_/ . n\n^ 

r(x) = lim —7 -7 7 r, 

^ ^ n -^00 x{x + 1) • • • (x + n) 



r(x + n) 

lim = 1- 

n— ^00 n^r(n) 



Actually one can prove that the previous formula is a characteristic for 
gamma. We have the following. 



7.89 Theorem. Let F :]0, oo[-^]0, oo[ be a function such that 

(i) F(x -h 1) = xF{x), 

(ii) F(l) = 1, 

(iii) lim„^oo = 1- 

Then F{x) = r(x). 
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Proof. In fact, 

F{n) = (n — 1)! and F{x + n) = x{x + 1) • • • (a; + n — l)F{x), 
therefore (iii) yields 

1= lim + + 

n-^oo n!n®~^ n— *-oo n!n®“^ r(x) 



We can also express T{x) as an infinite product: this is the original 
definition of Euler. 



7.90 Proposition. We have 
1 



r(x) 



oo 

n=l 



\p-x/n 



r 



where 7 is the Euler-Mascheroni constant. 

Proof. Write g{x) for the inverse of T{x) in the right-hand side. Taking the logarithm 
we see that 5 f(l) = 1 and 



j=i 



n 

= lii^ exp (— a; log n)x 

n^oo \ j 

x(i + f)(i + |)-..(i + ^) 



—xfn 



= lim 

n—^oo 



i.e., 



g{x) = lim 

n—*oo 



nln^ 

x(x -h 1) • — (x -h n) 



□ 



b. Functional relations 

7.91 Beta function. The function 

B{x,y) := [ t^~^{l -ty~^ dt, rr,y>0 

Jo 

is called the beta function. It is related to the gamma function by 
Proposition. We have 



B{x,y) 



r(x)r(y) 
r(x + y) ’ 



(7.53) 
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In particular, since 7(77 + 1) = n!, 

B{m + 1, n + 1) = 

V m / 



for all n, m G N. 
Proof. Set 

We have 






r(y) 

since B{l,y) = 1/y. log / is convex, since x — > B{x,y) is convex: this can be proved as 
in Proposition 7.86. Finally, 



= (a: + y) B(x + 1, y) =xf(x) 

r(2/) 



since B{x + 1,2/) = -^^B{x,y) as it is easily seen by performing an integration by 
parts. Theorem 7.87 then yields f{x) := F(a:). □ 



7.92 r(l/2) = The substitution t = sin^0 in the definition of the 
beta function turns (7.53) into 

r(a:) r{y) ^ ^ f ! 

r(x + 7/) 7o 

This for X = t/ = 1/2 gives 

r(l) = (7.54) 

7.93 e dx = 0r. The substitution t = in the definition of T 
yields 

r(x) := 2 / ds. 

Jo 

The special case x = 1/2 then gives the important 




ds = 2 




.2 



e 



ds = >/ir. 
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7.94 Duplication formula of Legendre. It is not difficult to verify 
that Theorem 7.89 applies to the function 






this yields the so-called duplication formula of Legendre 



r(x)r(x + = 2^-2*v^r(2x). 



7.95 Formula of complementary arguments. Performing the change 
of variable t = s/{l + s), we get 



B{x,y) = 



Jo Jo 



+00 



(1 + s)®+J' 



In particular if 0 < x < 1, 



poo gX—l 

r(x)r(l - x) = B(x, 1 - x) = / ds. 

Jo 1 + S 

If X = ^ , m > n, performing another change of variable, t = 



p roo ^2m 

Jo 1 + s Jo 1 + x^ 



ds = TT- 



in (2=tl,) 



see Example 5.36. Therefore, 



r(x)r(l — x) = B{x, 1 — x) = -r-f r 

sm(7rx) 

if X = (2m + l)/(2n), n > m. Since {(2m -f l)/2n|n > m} is dense in 
]0, 1[, and r is continuous, we conclude 



r(x)r(i — x) = B{x, 1 — x) 



sin(7T x) 



also for any x G]0, 1[. 



7.96. Taking logarithms on both sides of (7.51) we get 

oo 

logr(x) = -logx - 7X - ^ (log (l + 0 - 0. (7.55) 

n=l 

Expanding the logarithms occurring in the infinite series we get 

oo oo .. J 

log r(x) = — log X — JX + 

n=l j=2 ^ 
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and hence, by Weierstrass’s double series theorem and the relation 
r(x + 1) = xT{x), 



/ 1 

log r(x + 1) = -7 a; + V — —am) 

m 

m=2 

with C(m) := Er=i 

7.97 The function '0. The Gaussian psi function or digamma is defined 
as the logarithmic derivative of the gamma function 

V>(x) = niogr(x) = 

r(x) 

Prom (7.55) we obtain 

k=l 

the series being absolutely and uniformly convergent in any bounded closed 
interval of ]0, oo[. In particular we have 



V’(l) = -7 

since 2Z^i(l/(l + k) — 1/k) = —1. By logarithm differentiation we can 
easily translate relations of the P function into relations for the '0. For 
instance, we have 

'0(x + 1) — i^{x) = - 

X 

and therefore for any n G N, n > 1, 

n ^ 

V>(x + n) - V’(x) = x + k-1 ’ 

k=l 



also 



■0(x) — ■0(1 — x) = — 7T cot 7TX, 

21og2 + 0(x) + 0(x+ = 2ip{2x), 

0(1/2) = - 7 -2 log 2, 

0(n + l) = -7 + ^t, 

k=l ^ 

0(n + l/2) = - 7 - 21 og 2 + 2^ ^^^_^ 



in particular 
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7.98 An integral representation of We conclude this section by 
giving an integral representation of We have 



'^(x) + 7 = 



1 

X 



n 



1 

X k 



1 

k 



) 








Reversing the order of summation and integration and using the formula 
for the sum of a geometric series we then can easily conclude 



'0(x) + 7 = 




e * — e 
1 — e~^ 



dt. 



7.99 An integral representation of -0'. Finally, the formula for '0' is 
very useful. It is obtained by differentiating under the integral sign 

= / ^ dt. (7.56) 

Jo i — e 

Actually, in the above, reversing the order of summation and integration 
and differentiating under the integral sign require some justification. One 
can provide ad hoc justification, but we prefer not doing it since it becomes 
much simpler in the context of Lebesgue integration. 



c. Asymptotics of T and t/? 

Suppose that f{t) : [0,oo[^ R is a smooth function which has, together 
with its derivatives, at most a polynomial growth near infinity. Then 







X > 0 



is well defined. We are interested in its asymptotic expansion near infinity. 
Integrating by parts n times we infer 






£>7(i)e 



—xt 






k=0 



1 



^ £*^/(0) r„(x) 



If we also assume that 



pOO 

Jo 



dt < + 00 , 



we readily conclude 
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k=0 



as X 00 . 



The previous remarks apply, as it is easily verified, to the derivative of the 
ip function 

as D^{t/{e* — 1)) = Bk, we then conclude 

rP(x) = - + ^ + ^ ^ 2 fc+i +*^(^ 2 ;ht)- 



k=l 



Integrating twice over ]0, oo[, we obtain 



«x) = A + logx - 1 + o(^) 

k=l 

logr(x) = S + (A — l)x + logx 



(7.57) 

(7.58) 



+ E 



“ 2 k{2k — 1) x^^-^ \x 



as X 00. 



where A and B are two constants. In particular we have 

logr(x) = S + (A - l)x + (x - 1/2) logx + 00^ 

Prom the relation F(x + 1) — r(x) — logx = 0 it follows that 

- 1 + ^x + log(x + 1) - ^x - logx = as X 00 , 

which implies A = 0. Similarly from the duplication formula of Legendre 
we infer that B = (log27r)/2. Therefore we conclude with the asymptotic 
representations for T and 'ip: For all n € N, n > 1, we have, when n — ^ oo, 

- n— 1 



ip{x) = logo; — 



2x 



y +o(J_) 

2kx^^ 



k=l 



log r(x) = X log X — X — log y/x + log 



(7.59) 

(7.60) 



n— 1 



+ E 

k=l 



B2k 



2k{2k — l)x‘^^ 



-1 +<^(a-2„-i); 



in particular 



r(x + 1) 






and, for x = n, Stirling's formula^ 
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7.5 Summing Up 

Convergence and continuity of the sum 

o The radius of convergence p of a power series is defined by 

- := limsup \/\an\, 

P n—*oo 

where we have adopted the conventions l/oo = 0, I/O”*" = oo. Ifp>0, equivalently, 
if the sequence {|an|} grows at most exponentially, then ^nZ^ converges abso- 

lutely in the interior of the disc of convergence { 2 : € C | | 2 :| < p} and does not converge 
if 1 2 : 1 > 1. Therefore the domain A C C in which ^nz'^ converges, that is the 

domain in which the sum S{z) = X^^o ^nz"^ exists as a complex number, is the disc 
of convergence union eventually part of or the whole boundary {z £ £,\\z\ = p}. 

o Powers series converge uniformly on any disc {z G C | | 2 :| < r}, Vr < p. This means 
that for any r < p the error we get substituting the sum with a partial sum 

00 

I XI 

n=p 



is bounded by a quantity c(fc, r) that goes to zero as p 00 and is independent of 2 : 
provided \z\ < r. This is equivalent to saying that 

00 

M(p,r) := sup ® as p 00 . 



Uniform convergence on all discs strictly included in | 2 ;| < p, is less than the uniform 
convergence on { 2 : 1 12:| < p}. However, it suffices to prove that 
o the sum S(z) := ^nz'^ is continuous on the interior of the disc of convergence. 



Differentiation and integration of power series 

Let 13^0 ^nz'^ be a complex power series with a positive radius of convergence p > 0, 
and sum S{z). Then 

o S{z) has a complex derivative on { 2 : G C | \z\ < p} and DS{z) = , 

that is, the sum of power series can be differentiated term by term in the interior of 
the disc of convergence. 

o Actually S{z) has complex derivatives of any order on { 2 : | | 2 :| < p} and 

00 

D'^Siz) = ^ n(n - 1) • • • (n - fc + 1)2"-'=. 

n=l 



o We have a^ := D^S{0)/k\ V/c, that is, each power series with a positive radius of 
convergence is the Taylor series of its sum. 
o The sum of a power series can be integrated term by term in the interior of the disc 
of convergence: if p > 0 denotes the radius of the disc of convergence of XI^o ^nz'^^ 
then, if 1 2 1 < p, we have 



L 



S{z)dz=S^ an . 

0 ^ " + 1 



The symbol is the classical oriented definite integral in case 2 : G M, while it is 
suitably defined when 2 : G C, see Section 7.1.3. 
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Boundary values 

Let be a complex power series with a positive radius of convergence p > 0, 

and sum S{z). Two cases can occur: either the series converges absolutely at a point 
ZQ with |zo| = p, or the series eventually converges, but not absolutely, at some points 
with \z\ = p. In the first case, which implies the uniform convergence of the series 
and the continuity of the sum on the closed disc {-2 1 < p}, the convergence test of 
Proposition 7.29 may be useful. In the second case, some information at the boundary 
is provided by Dirichlet’s and Abel’s theorems. Theorems 7.30 and 7.31. A consequence 
of Abel’s theorem is that Ai C A2 C A3 if Ai, A2 and A3 are respectively the domains 

of convergence of “"CfT' 



7.6 Exercises 



7.100 f . Show that 



00 , 



n=0 
00 



n=0 



z^ -\-z 



l-z2’ 



1-Z2’ 

n=0 ^ 

(-1)" 1 

p. ’ 



n=0 



(-1)^ 

2n 



7.101 K. Compute the sums of the following series 



cos(l + 2n) 






n=U 

00 ^ 2 n+l 



■^C ,xn Sin(l + 2n) 

ho ( 2 ")! ’ 

^ -3n+2 
^ z2n+l 

• ^ 71 . 






]E log(cos(i/2'')). 



[ifmt: For the last series, show that 



n- ^ = 



\2^J 2'^sm{x/2'^)’^ 
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2 e c, 



log(l + 2) = (-1)" 



-,n+l 



sinz = ^(-1)’ 



n + 1 ’ 



(2n + l)!’ 



|z|<l, Zft-1, 

2 e C, 



°° ^2n+l 

sinh 2 = 52 



^^o(2n + l)!’ 



2 e c, 



C0S2=52(-1)" 






2 6 C, 



n=0 ' ' 

(2n-l)!! z2^ 



= E 



^ (2n)!! 2n + 1 ’ 



kl < 1. 






arccosz = — — arcsinz, |z| < 1, 



arctanz = y~^(— 1)^ 



~2n+l 



n=0 



2n -4* 1 



|z| < 1, z 7 ^ ±^5 



oo ^2n+l 

tanh~^ z = , |z| < 1, z / ±1, 

^ 2n 4- 1 I ' - ’ ^ 



n=0 



1 oo 

-!-= 522", |2|<1, 

oo 

(i+^)“ = 520a w<i, 



where 



(2n)!! = 



if n = 0, 



[2n(2n-2)---4-2 if n> 1, 
and for a € M 

a{a — 1)(q; — 2) • • • (ck — n 4- 1) 



(2n4-l)!! = (2n4-l)(2n-l)---5*3-l 



0 -- 



Figure 7.4. A table of Taylor series of some elementary functions. 
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7.102 f . Let f{x) := | sina;|/x, a: > 0, and for n = 0, 1, . . . , let 

r / \ _ J /(^) nn < X < {n + 1)7T, 

Jn{x) < 

I 0 otherwise. 

Show that 

(i) E“=0 > 0. 

(ii) sup^>o I Efc^n+l fki^)\ 0. 

(iii) E^=o sup^>o |/n(x)| = +00. 

7.103 Show that + (— l)”+^/n) converges uniformly in [0,1/2], but it 

does not converge absolutely. 

7.104 f. Show that x + — {k + converges, the limits 

of the sum are the sum of the limits, but it is not uniformly convergent. 

7.105 Show the following 

Proposition. Suppose converges uniformly on \z\ = 1. Then ^nZ^ 

converges uniformly on \z\ < 1. 

[Hint: Reread the proof of Abel’s theorem.] 

7.106 f1. Let f{z) = ^nZ^ on \z\ < r. Then f{z) is representable as power 

series ^n{z — zq)'^ with center any zq with \zq\ < r and domain of convergence 

that contains {z\\z — zq\ < r}. [Hint: Set z = zq + h and p := \zq\ — \h\, and write 

oo oo ^ JU 

k=0 k=0 t=o ^ 



7.107 Composition of power series. Suppose that S(x) = 

'^(y) •= 12^=0 are two power series with T(0) = 0 and, respectively, with positive 
radii of convergence p(<S') and p{T). Show that the composition 5 o T is the sum of a 
power series with positive radius of convergence. More precisely show that, if r > 0 is 
such that ^ then the radius of convergence of 5 o T is at least r. 

7.108 f ^ Inverse of a power series. Let S{x) = be a power series 

with 5(0) 0 and positive radius of convergence. Show that there is a power series 

T with radius of convergence 1 such that S{x)T{x) — 1. 

7.109 f % Reciprocal power series. Let S{x) = be a power series with 

5(0) = 0, 5'(0) ^ 0 and positive radius of convergence. Show that there is a power series 
T with T(0) = 0 and positive radius of convergence such that S{T{x)) = x [Hint: see 
e.g., Cartan, Theorie elementaire des functions analytiques d’une ou plusieurs variables 
complexes, Hermann, Paris, 1961.] 

7.110 f Let / : N R be a function such that /(I) ^ 0, f{xiX2) = f(xi)f{x2) 

Wxi,X 2 and Show that 

(i) /(I) + E ?^1 \f{n)\’^ < oo. n^i(l - f{p)) < oo, n^i < oo and finally 

OO 

= E /'‘(P) = 1 + ■'■(P) + + • • • • 

1 fc=0 
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(ii) If {pn} is the sequence of primes and 

1 1 1 



Pn := 









then 



Pr,=J2f(N), 

N 

the sum being taken on all naturals N which decompose in prime factors con- 
taining only the primes pi,P 2 , • • • ^Pn- 
(hi) Euler’s formula 



E/w= n 



n = l p prime 

(iv) If /(n) = 1/n^, with s > 1, then 



/(p) 



oo ^ 

E-= IT — ^ 

Z—/ i i 1 — p 



n=l p prime ^ 

the product being taken on all primes. The function 



cx) ^ 

«•) - E 



is actually well defined for all s G C with > 1 and is called Riemann’s C- 
function. 

7.111 IMf Euler. Let {pi} be the sequence of primes. Then ^ = +oo. [Hint: If 

^/Pi < ^5 then for some m € N we have J2i>m ^/Pi < t/2. Setting a := Olii Pi^ 
Pi divides 1 + not at most for i > m, hence 



oo ^ oo ^ . oo 

1 + na ^ V ^ Pi / 2^ 



= 1 i>m 

7.112 Find the series solutions of the differential equations 

y" - xy' y = 0, y" - x^y = 0. 

7.113 Find the series solutions of the Cauchy problems 

f y" + {x- l)y’ -{x- l)y = 0, / 

\y(l) = 1, y'(l) = 0, \y(0) = y'(0) = 1. 

7.114 The equation 

y" -\-e^y = 0 

has a solution of the form y{x) = that satisfies p(0) = 1, p'(0) = 0. Find 

some of its first terms. 

7.115 % Legendre’s equation. Find the series solutions of the ODE 

{1 - x‘^)y" -2xy' + Xy = 0 



called Legendre’s polynomials. [Hint: y{x) = uq — ^x"^ — - ^ 

^ 2 , (12-A)(2-A) 5 , . . ^ 1 

3-2 ^ 5-4-3-2 ^ )'\ 



X -h 
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7.116 ^ Hermite polynomials. Find the series solutions of the ODE 

y" - 2xy' + 2 y = 0 , 

called Hermite’s polynomials. 

7.117 f Euler equation. Show that there is no series solution except 0 of the equa- 
tion 

y” -\-axy' -h I3y = 0. 

Show that y{x) = x'^ where 7 satisfies 7(7 — 1) -F 07 -f- ^ = 0, is a solution. 

7.118 ^ Frobenius’s method. Examples 7.53 and 7.54 suggest that the power series 
method may not work if the higher order coefficient of the second order linear ODE 
vanishes at x = 0. In this case we may try solutions of the form 

00 

x'^ a^x^, 7 G M. 

fc =0 

Try the method with the following Bessel’s equation 

x'^y" -\-xy' (x‘^ - ^)y = 0 

4 

and Laguerre’s equation 

xy" + (1 - x)y' + y = 0. 

7.119 f . Let A{x) and E{x) be respectively the enumerator and the exponential enu- 
merator of {un}. Show that 



A{x) = / e ^E(sx)ds. 

Jo 

7.120 f . Let {pn} be a sequence in [0, 1[ and let P{x) be the enumerator of {pn}- The 
k-moment of {pn} is defined by 



rrik := 

3=0 

Assuming that is finite for all /c > 0, show that the enumerator of {rrifc} is 

M{x) = P{e^). 

7.121 Show that the binary numbers of 2 n bits are (^^). 

7.122 Show that . 

7.123 Show that 

1 ^ ^ /m + fc\ fc 



7.124 Show that 

00 

n=0 

where Hn{x) solves y" — 2 x y' 2n y = 0. 



gn(x) ,„ 

n! 
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7.125 Show that the enumerator for the selection of r objects out of n objects, 
r > n, with unlimited repetitions but with each object included in the selection, is 



(£'T-£C:;) 



7.126 11. Show that the number of ways in which r nondistinct objects can be dis- 
tributed in n distinct cells, with the condition that no cell contains less than q nor more 

than q + z — 1 objects, is the coefficient of in the expansion of 



7.127 11. Show that a convex polygon of n H- 2 sides can be divided into 



Cn • — 



1 

n-f- 1 




triangles by means of diagonals that do not intersect. The numbers Cn are called Catalan 
numbers. [Hint: Notice that c^+i = hence, if c(x) = we 

have c^{x) = and xc^(x) = c{x) — 1.] 



7.128 11. Show that the exponential enumerator for the distribution of r or less ob- 
jects into n distinct cells, with objects in the same cell ordered, is expx/(l — x). 
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Bernhard Bolzano 


Carl FViedrich Gauss 
(1777-1855) 

Simeon Poisson 


Wilhelm Bessel 


(1781-1848) 


(1781-1840) 


(1784-1846) 




Augustin-Louis Cauchy 
(1789-1857) 





George Green Mikhail Ostrogradski Niels Henrik Abel Jean-Charles-Frangois Sturm Carl Jacobi 
(1793-1841) (1801-1862) (1802-1829) (1803-1855) (1804-1851) 



Lejeune Dirichlet 
(1805-1859) 



William R. Hamilton 
(1805-1865) 



Joseph Liouville 
(1809-1882) 



Karl Weierstrass 
(1815-1897) 



George Gabriel Stokes 
(1819-1903) 



Eduard Heine 
(1821-1881) 



Leopold Kronecker 
(1823-1891) 



Joseph Bertrand 
(1822-1900) 



Charles Hermite 
(1822-1901) 



Enrico Betti 
(1823-1892) 



Richard Dedekind 
(1831-1916) 



Camille Jordan 
(1838-1922) 



Edmond Laguerre 
(1834-1886) 

Gaston Darboux 
(1842-1917) 

Georg Cantor 
(1845-1918) 



Eugenio Beltrami 
(1835-1899) 



Giulio Ascoli 
(1843-1896) 



Hermann Schwarz 
(1843-1921) 



Ulisse Dini 
(1845-1918) 



Cesare Arzelk 
(1847-1912) 



Luigi Bianchi 
(1856-1928) 



J. Henri Poincare 
(1854-1912) 



David Hilbert 
(1862-1943) 



Figure 7.5. Infinitesimal analysis: a chronology from Gauss to Poincare and Hilbert. 
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The laws of classical physics are deterministic: if we know exactly the 
state of a system at a given instant, we know its state for all times. Such a 
principle, which mathematically corresponds to the existence and unique- 
ness theorem for the Cauchy problem^ has been (and is) a key idea in 
scientific thought. Pierre-Simon Laplace (1749-1827) wrote in his Essai 
philosophique sur les probabilites. 

Nous devons done envisager I’etat present de I’Univers comme 
I’effet de son etat anterieur, et comme cause de celui qui va 
suivre. Une intelligence qui pour un instant donne connaitrait 
toutes les forces dont la nature est animee et la situation re- 
spective des etres qui la composent, si d’ailleurs elle etait as- 
sez vaste pour soumettre ses donnees a I’analyse, embrasserait 
dans la meme formule les mouvements des plus grands corps 
de rUnivers et ceux du plus leger atome : rien ne serait incer- 
tain pour elle, et I’avenir, comme le passe, serait present a ses 
yeux. L’esprit humain offre dans la perfection qu’il a su donner 
a I’astronomie une faible esquise de cette intelligence.^ 

But this principle is often contradicted by everyday experience, when 
some facts seem to take place unpredictably and at random, as is the case 
with metereology. 

From the point of view of predictability^ since there will always be a 
certain degree of uncertainty on the initial situation, things will be pre- 
dictable if two initially close states evolve closely, otherwise one has to ex- 
pect chaotic behavior: close states evolve into paths that are very far from 
each other. Probably the first to have stated precisely the phenomenon of 
sensitive dependence on initial conditions were Jacques Hadamard (1865- 
1963), who studied the fiux of geodesic lines on a surface, and Pierre Duhem 



^ Therefore we have to consider the present state of the universe as the effect of its 
previous state and cause of its future state. An intelligence that at a given instant 
could know all the forces that animate nature and the respective situations of all 
the beings that constitute it, an intelligence that, moreover, could be large enough 
to be able to analyze all these data, could contain in the same formula both the 
movements of the largest bodies in the universe and of the smallest atom: nothing 
would be uncertain for it and the future, as well as the past, would be present to its 
eyes. The human spirit offers just a feeble trace of this intelligence in the perfection 
it was able to give to astronomy. 
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1827) and the frontispiece of his 
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(1861-1916), who observed how the sensitive dependence on initial con- 
ditions made long term previsions illusory for the systems considered by 
Hadamard^. The fact that these systems are no exception seems clear to 
J. Henri Poincare (1854-1912) who writes in Science et methode 

Une cause tres petite, qui nous echappe, determine un effet con- 
siderable que nous ne pouvons pas ne pas voir, et alors nous 
disons que cet effet est du au hasard. Si nous connaissions ex- 
actement les lois de la nature et la situation de TUnivers a 
I’instant initial, nous pourrions predire exactement la situation 
de ce meme Univers a un instant ulterieur. Mais, lors meme que 
les lois naturelles n’auraient plus de secret pour nous, nous ne 
pourrions connaitre la situation initiale qu’approximativement. 

Si cela nous permet de prevoir la situation ulterieure avec la 
meme approximation, c’est tout ce qu’il nous faut, nous dis- 
ons que le phenomene a ete prevu, qu’il est regi par des lois ; 
mais il n’en est pas toujours ainsi, il peut arriver que de pe- 
tites differences dans les conditions initiales en engendrent de 
tres grandes dans les phenomenes finaux ; une petite erreur sur 
les premieres produirait une erreur enorme sur les derniers. La 
prediction devient impossible et nous avons le phenomene for- 
tuit.^ 



^ See La theorie physique, son object et sa structure, Editions Chevalier et Riviere, 
1906. 

^ A very small cause that escapes our attention determines a notable effect that we 
cannot fail to see, and in this case we say that it is due to hazard. If we knew exactly 
the laws of nature and the situation of the universe at the initial moment, we could 
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He also adds 

Comment devons-nous nous representer un recipient rempli de 
gaz ? D’innombrables molecules, animees de grande vitesses, sil- 
lonnent ce recipient dans tons les sens ; a chaque instant elles 
choquent les parois, ou bien elles se choquent entre elles ; et ces 
chocs ont lieu dans les conditions les plus diverses. Ce qui nous 
frappe surtout id, ce n’est pas la petitesse des causes, c’est leur 
complexite. Et cependant, le premier element se retrouve encore 
id et joue un role important. Si une molecule etait deviee vers 
la gauche ou vers la droite de sa trajectoire, d’une quantite tres 
petite, comparable au rayon d’ act ion des molecules gazeuses, 
elle eviterait un choc, ou die le subirait dans des conditions 
differentes, et cela ferait varier, peut-dre de 90^ ou de 180°, la 
direction de sa vitesse apres le choc. 

Et ce n’est pas tout, il suffit, nous venons de le voir, de devier 
la molecule avant le choc d’une quantite infiniment petite, pour 
qu’elle soit deviee, aprd le choc, d’une quantite finie.^ 

This somehow explains how a deterministic and regular behavior may 
generate chaos; but on the other hand it may suggest that a chaotic be- 
havior may create order, possibly on a different scale from the macroscopic 
behavior of the gas. 

Chaotic behavior may be generated by sensitive dependence on the 
parameters of the system, as well as by sensitive dependence on the initial 
data. For instance, the behavior of water coming out of a slightly open 
tap is regular, while it gets chaotic if the tap is completely open. In the 
same way, the behavior of a fluid between two rotating cylinders is regular 
when they rotate and gets more and more chaotic as the rotation speed 
increases. 

Poincare wrote also 

predict exactly the situation of the same universe at a succeeding moment, but even 
if it were the case that the natural laws had no longer any secret for us, we could 
still only know the initial situation approximately. If that enables us to predict the 
succeeding situation with the same approximation, that is all we require, and we 
should say that the phenomenon has been predicted, that it is governed by laws. But 
it is not always so: it may happen that small differences in the initial conditions 
produce very great ones in the final phenomena. A small error in the former will 
produce an enormous error in the latter. Prediction becomes impossible, and we 
have the fortuitous phenomenon. 

^ How should we represent a container full of gas? Innumerable molecules race inside 
the container in all directions; at every instant they hit the container’s sides or collide 
with one another; and all these collisions take place in the utmost diverse conditions. 
What strikes us in this case is not the smallness of the causes but mainly their 
complexity. And yet, the first element is still present and plays an important role. 
If a molecule were to be diverted to its left or right by a small quantity comparable 
to the range of action of a gas molecule, it could avoid a collision or undergo the 
collision under different conditions, and this could change its direction by 90 or 
maybe 180 degrees. And this is not all, we have just seen that it is sufficient to divert 
the molecule of an infinitesimal quantity before the collision in order to divert it, 
after the collision, of a finite quantity. 
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Figure 8.2. J. Henri Poincare (1854- 
1912) and the frontispiece of his 
Methodes nouvelles de mecanique 
celeste. 
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Pourquoi les meteorologistes ont-ils tant de peine a predire le 
temps avec quelque certitude ? Pourquoi les chutes de pluie, les 
tempetes elles-memes nous semblent-elles arriver au hasard, de 
sorte que bien de gens trouvent tout naturel de prier pour avoir 
de la pluie ou le beau temps, alors qu’il jugeraient ridicule de de- 
mander une eclipse par une priere ? Nous voyons que les grandes 
perturbations se produisent generalement dans les regions ou 
I’atmosphere est en equilibre instable, qu’un cyclone va nmtre 
quelque part, mais ou ? Ils sont hors d’etat de le dire ; un dixieme 
degre en plus ou en moins en un point quelconque, le cyclone 
eclate id et non pas la, et il etend ses ravages sur des contrees 
qu’il aurait epargnees. Si on avait connu ce dixieme de degre, on 
aurait pu le savoir d’avance, mais les observations n’etaient ni 
assez serrees, ni asses precises, et c’est pour cela que tout semble 
du a I’intervention du hasard.^ 



^ Why do meteorologists have such a hard time in foreseeing the weather with a rea- 
sonable degree of precision? Why do showers and storms seem to occur at random, 
so that many people find it absolutely natural to pray for rain or good weather, while 
they would find praying for an eclipse utterly ridiculous? We see that great pertur- 
bations generally occur in regions where the atmosphere is unstable. Meteorologists 
are well aware of the instability of the equilibrium and that somewhere there will 
be a hurricane, but where? They cannot tell, because a tenth of a degree more or 
less at any point will determine a hurricane here instead of there, and there will be 
devastations in areas that would have been spared. If one had known this tenth of a 
degree one could have foreseen the event, but observations were neither sufficiently 
frequent nor sufficiently precise, and for this reason everything seems to be due to 
the intervention of hazard. 
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Even though mathematicians have always known that dynamical sys- 
tems may behave in unexpected and complicated ways, it is only with 
the invention of computers and the increasing interests in mathematical 
models for population dynamics, biology, electronic circuits with nonlinear 
components, astronomy and metereology that the study of deterministic 
chaos has acquired importance to the point of becoming a fashionable 
subject not only among mathematicians and physicists. Particularly rele- 
vant in this process are the contribution of Hendrik Lorentz (1853-1928), 
meteorologist at MIT, who published in 1963 a simplified model of fiuid 
that shows the rapid growth of errors in dependence on initial conditions, 
and the work of the two mathematicians, D. Ruelle and F. Takens, who, 
in 1971, conjectured that hydrodynamic turbulence may be represented 
by strange attractors, mathematical objects that describe evolutions with 
sensitive dependence on initial conditions. 

One may have the impression that complex dynamics is typical of dis- 
persive or nonconservative dynamics. But this is not true, as is shown by 
the question of the stability of the solar system. 

It is commonly agreed that in Newton’s opinion gravitational inter- 
actions among planets were so strong that they could compromise the 
stability of the system and that probably for this reason he formulated the 
hypothesis that it was God who controlled these instabilities in order to 
ensure the existence of the solar system; Newton writes in his Principia 

It is not to be conceived that mere mechanical causes could 

give birth to so many regular motions This most beautiful 

system of the sun, planets, and comets, could only proceed from 
the council and dominion of an intelligent powerful Being. 

During the Age of Enlightenment, Lagrange, Laplace, and Poisson pro- 
vided mathematical reasons in favour of the stability of planetary orbits, 
showing, for instance, absence of polynomial growth in time of the major 
axis of the orbit up to third order with respect to the planetary masses. 
More recently Poincare and George Birkhoff (1884-1944) showed that in 
the dynamics of planets one may encounter instabilities that make the no- 
tion in the phase space quite complex and the more recent contributions 
of A. N. Kolmogorov, V. A. Arnold, and J. Moser suggest the coexistence 
of both stability and instability. All the same, many questions regarding 
n-bodies are still unresolved.® 

So far we have always referred to continuous dynamical systems. How- 
ever, discrete dynamical systems are naturally associated to continuous 
ones, for instance in the form of discretization or of a Poincare map. They 
also naturally occur in the study of population dynamics and are the ap- 
propriate models for computers and often show a chaotic behavior even 
when the corresponding continuous system does not. 



® See the paper by Jacques Laskar in A. Dahan Delmedico, J. L. Chabert, K. Chemla, 
Eds, Chaos et Determinisme, Editions du Seuil, Paris, 1992, and S. Marmi, Chaotic 
behavior in the solar theory, Seminaire Bourbaki 5*”® annee, 1998-1999, n. 854. 
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In this chapter we shall describe the behavior of a few simple discrete 
dynamical systems with the aim of showing some paths leading to chaos. In 
fact, a wider and more precise analysis cannot avoid a wider and more de- 
tailed study of ordinary differential equations and further technical tools. 
In Section 1 we discuss first and second order linear difference equations, 
some nonlinear examples of recurrences and continued fractions; in Sec- 
tion 2 we shall then illustrate some aspects of one-dimensional dynamical 
systems. 



8.1 Recurrences 

In the previous chapters we encountered on several occasions recursive 
relations^ some of which lead to closed form sequences while some do not, 
and we studied in some detail the process of summing with the analysis of 
series. In this section we discuss a few more classical recurrences that from 
a dynamical point of view, that is from the point of view of the behavior 
at infinity, are quite regular. 



8.1.1 Linear difference equations 

In this section, we discuss first order linear difference equations. They are 
the discrete version of the first order ODE and can be solved in closed 
form. 



a. First order linear difference equations 

We recall (see Example 2.5) that, given {/n}, the recurrence 

{ xo given, 

x„+l =Xn + /n+1, Vn > 0, 

is equivalent to 

n 

Xn :=Xo + ^fj. 

j=l 

Moreover, we have the following. 

8.1 Proposition. Given a eR and {fn}, the solution of 



Xo given, 

Xn-^l = aXfi + /n+l? VtI ^ 0, 



( 8 . 1 ) 




8.1 Recurrences 



303 



Xn := aP'xo + ^ fj- ( 8 . 2 ) 

J = 1 

In fact, the sequence {xn} given by ( 8 . 2 ) verifies the recurrence relations 
( 8 . 1 ). The solutions of ( 8 . 1 ) have a structure, which is quite similar to the 
structure of the solutions of first order linear ODE (see, e.g.. Chapter 5 of 
[GMl]): 

o the sequence u = {un}, given by Un = aP' solves 
Uq — I5 lln+1 — ^ ^ ^ 

o if / := {/n}, then the product of convolution of u and /, {tx * /}, 
(u * f)n := Er=o solves 

= fo, 

b„+i = axn + fn+1, Vn > 0, 



since 

n+1 n 

{u * f)n+l = ^ Un+i-jfj = ^ + fn+1 = a (u * /)„ + /„+i; 

j=0 j=0 

o consequently, Xn := Un{xo — fo) -h (11 * f)n solves (8.1). 

Similarly, we easily see that first order linear difference equations with 
varying coefficients are uniquely solvable, and we have the following. 

8.2 Proposition. Let {an} and [fn] be two sequences. Then 

(i) The sequence u := [un], 

h ~ „ solves (*» = 

= n"=i n>l, = a„+i Xn, n > 0. 

(ii) The sequence {a;„}, Xn := u„ Y,]=o solves 



I a:o = fo, 

l^^n+l “ On+1 Xn fn+1, fl 0. 

(iii) Consequently, {xn}, 

n j 

Xn •— lln (^0 fo) Un ^ ^ "5 

j=0 ^ 

solves 

I Xo given, 

t^n+l “ ^n+1 ^n H“ /n+ 1 5 71 ^ 0. 
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b. Second order homogeneous difference equations 

A generic second order difference equation has the form 

aXn+2 + bXn-j-l + CXn = 0, (8.3) 

where a, b and c are constants and a ^ 0. We are interested in finding all 
solutions of (8.5), or, equivalently, in solving explicitly the recurrence 

f Xq O, X\ /3, 

\aXn+2 + bXn^l + CXn = 0, n > 0, 

for any given a and /? G R. As in the case of ODEs (compare, e.g., [GMl]), 

(i) if {xn} and {yn] solve (8.3), then also {ciXn + C 2 i/n}n solves (8.3) for 
any Ci,C 2 G C, 

(ii) if A is a solution of the characteristic equation 

aX"^ + bX + c = 0, 

then {A’^} solves (8.3): in fact, 

aA^+2 + 6A^+^ + cA^ = X^{aX^ + 6A + c) = 0. 

Let Ai,A 2 be the two solutions of the characteristic equation. Corre- 
sponding to the three cases of real and distinct roots, repeated real roots 
and conjugate complex roots, define the two sequences {un}, {vn} by 

= Xi, Vn = X 2 if Ai, A 2 G R, Ai A 2 , 

Un = X^, Vn = nX^ ifAi-A2, (8.5) 

Vn — lAil’^ cos(n(/?), Vn = |AiPsin(n(^) otherwise. 

The latter occurs if Ai,A 2 are complex conjugate, and in this case we 
have set 

Ai = |Ai|e^^. 

We have the following. 

8.3 Proposition. The solution of (8.4) is the sequence {xn} given by 
Xn = Cl Un~\- C 2 Vn where Cl , C 2 solves 

Cl uo + C 2 uo = a. Cl ui +C 2 V 1 = (3. (8.6) 

Proof, (i) Real and distinct roots. By linearity {x^}, Xn — ciAj + C 2 A 2 , 
solves (8.3). Moreover, since Ai ^ A 2 , for any a,/? G R, one can then solve 
for ci,C 2 the system in (8.6). 

(u) Complex conjugate roots. Let A := Ai ,and A = A 2 . Similarly to (i), all 
complex valued sequences {xn} Xn = ciX'^ -h C 2 A , ci,C 2 G C, solve (8.3). 
For given a, P we then solve in C, since A ^ A, (8.6) to get the solution 
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Xn = Cl A"" + C2 A 

of (8.4), an a priori complex function. 

But a,/? being reals, when solving (8.6), we get C 2 = hence the 
solution found is real and written as 

SR(ci A-), 

or, setting c = a — a, 6 G M, by de Moivre’s formula, 

5ft(ci A’^) = a5R(A’^) + b^{\^) = a |AP cos{rup) + b |AP sin(nc^). 

(iii) Repeated real roots. Let Ai = A 2 = A. Since in this case 2aA + 6 = 0, 
we have 



a (n + 2 )A ^+2 + 1)A^+^ - cn\^ 

= n\^{a\^ + 6A + c) + A^(2aA + 6) = 0, 

i.e., {nA^} solves (8.3). Therefore all sequences {x^}, Xn = A’^(ci +C 2 n) = 
ciUn + C 2 Vn^ Ci, C 2 G R, solve (8.3). Since the system in (8.6) yields ci and 
C 2 , we find the solution of (8.4). □ 



c. Second order nonhomogeneous difference equations 

Consider the recurrence 

{ Xo = 0, xi = /?, 

O^^n+2 "b bXji.\.\ + CXn = /n+1) 

where a, 6, c G R and {/n} is a given sequence. 



(8.7) 



8.4 Proposition. Let {wn} be the sequence that solves the homogeneous 
recurrence 



Wo = 0, wi = 1, 

aiCn+2 + bWn-\-l + CWn =0, n > 0. 



( 8 . 8 ) 



Then the sequence {xn} given by 



1 1 "" 

:= -{(u; ♦ /)„} = 



3=0 



solves 

f Xo = 0, Xi = fo/a, 

"h ^Xy^-j-l “I" CXyi ” ^n+l) Tl ^ 0. 
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Proof. In fact, 



n+2 n+1 n 

a Wn-\-2-jfj + b Wn-i-l-jfj + C Wn-jfj 



j=0 j=0 j=0 

= Ci'^ofn+2 + tt'W^l/n+1 + bWofn+1 



n 

+ y ^ (0''^n+2— j "I" ^'^n+1— j “1“ ^'^n—j)fj 
j=0 






□ 

By linearity, with the notation of Propositions 8.3 and 8.4 we conclude 
8.5 Theorem. The solutions of the linear second order recurrence 

U> ^n+2 “i" b Xji-\-i “h CXfi = fn+1 

are given by the two-parameter family of sequences 

Xn = ClUn + C2Vn + ~ (it; * /)n, Ci,C 2 G R, 

a 

where {un} and {vn} are defined in (8.5) and {wn} in (8.8). 
d. transform and Laplace transform 

Linear difference equations can be solved also using the method of gen- 
erating functions or, better, a slight modification of it known as the Z- 
transform, see 8.7 below. 

Let a = {un} be a sequence of complex numbers which grows at most 
exponentially, \an\ < CM'^ for some M > 0. The Z -transform of {on} is 
the complex- valued function 

oo ^ 

:= — 
n=0 

that is defined at least in {z | |z| > M}. Of course 

Z{a}{z) = T{a}(^), 

T{a) being the generating function of a = {an}. Using properties of power 
series, we see that 

(i) Z{a} uniquely determines {un}, 

(ii) Z is linear, i.e., if A, /x G C and a = {un}, b = {bn} grow at most 
exponentially, then 

Z[Xa + pb}{z) = XZ{a}{z) + pZ{h}{z), |> 2 ;| large. 
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(iii) If Bk := {(0, . . . , 0, 1, 0, 0, . . . )} is the Kronecker sequence then 

k 

(iv) If a = {a„}, and 

:= {( 0, • ^-,0 ,00,01, Q2,---)} 
k 

is the forward shift by k places, then 

OO - . 

= E = 3 W- 

n=k 

(v) If a = {an}, and T-k{a] := {an-\-k}n is the backward shift by k places, 
then 



OO - 

2{Tk{a}}{z) = Y,an+k— = z'"(2{a}{z)-ao~~ 

n=0 



0^k-l\ 



(vi) Z transforms the convolution product of sequences into the product 
of the transformed functions, see Theorem 7.34, 



OO - OO - OO - 

2(a . 6>(.) - E(» ♦ j;; = ( E ««?r) ( E 

n=0 n=0 n=0 

= ^{a}(z)2{b}(z). 

The notion of Z-transform (and of generating function) is very useful 
in several fields: in combinatorics, as we have seen, in probability, in data 
sampling, in the study of digital filters, just to mention a few. The Z- 
transform is known, especially to engineers, as the discrete version of the 
Laplace transform^ which is particularly useful when studying the Cauchy 
problem for linear ODE. 

The Laplace transform of a continuous function that grows at infinity 
less than exponentially is defined by 



pOO 

C{f]{z):= f{t)e-^^dt, 

Jo 



5Rz>0. 



If / is the piecewise constant function defined by f{t) = Un if n < t < n+1, 
then 

C{f}{z) - E 

—Z ' z 

n=0 



Also, if for all h G N we set 
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h(i) := {“ 



ifn<t<n-h^, 

otherwise, 



then 

therefore 

The functions {l/h)fh may be thought of as approximations of “impulses” 
concentrated at the integers. 



e. Fibonacci’s numbers 

The simplest possible recurrence in which each number depends on the 
previous two is the one defining Fibonacci’s numbers. They occur in a 
wide variety of situations. Here is how Leonardo Pisano (1170-1250), called 
Fibonacci, came to them. 

Assume that every month every couple of rabbits gives birth to a couple 
of rabbits that can reproduce from their second month of life on. How many 
couples of rabbits are there after n months if we start with a newborn 
couple? If {/n} is such a number, of course, /i = /2 = 1, moreover with a 
newborn living at the n-th month, /n, are the ones of the previous month 
plus those generated by the rabbits who were alive two months before. 



/n+2 — /n+1 + /n- 

Fibonacci’s sequence {/n} is defined by 

f/o = 0, /i = l, 

\/n+2 = /n+1 + /n? n > 0. 

If we write the characteristic equation 

-z -I = Q 

of the recurrence to get its solutions, A = and /i = — we 

conclude on account of Section 8.1.1 that 

fn = C2fJ^ n > 0, 



where c \ , C 2 solve 



Therefore 



/o = Cl + C 2 = 0, 

/i = ciAi + C2A2 = 1 . 
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Figure 8.3. Frontispieces of the first printed edition and the first English translation of 
Liber Abaci of Leonardo Pisano (1170-1250), called Fibonacci. 



8.6 Proposition (Binet’s formula). We have 

= Vn>0. 

Since |/il’^/\/5 g]0, 1/2[, and fx is negative if n is odd and positive if n 
is even, fn is the integer part of X'^/y/E if n is odd, and the integer part of 
plus 1 if n is even. In any case, fn is the closest integer to X^/y/E. 



8.7 Fibonacci’s numbers by Z-transform. One can solve the Fibo- 
nacci recurrence also using the .^-transform. In fact, multiplying the n-th 
recurrence relation by ^ and summing, we get 

OO ^ OO ^ OO - 

fn+2^ 

n=0 n=0 n=0 

that is 

-fo-h^-z [z{f}{z) - /o) - cZ{f}{z) = 0, 



i.e., 



2f{z) = 



z 

z‘^ — z — 1' 



Notice that the denominator of Z{f}{z) is the characteristic equation of 
the Fibonacci recurrence. By Hermite’s decomposition formula 
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Z2 - z - 1 



.1 r, 1 

= A r + B- 



z — \ 



with 



A := 



= B:= 



z — n 

1 



2A-1 v^’ ■ 2/i-l 

and consequently, recalling that 



J_ 

'7E 



2{f}{^) 



z^ — z — 1 

oo 






1 



+ B- 



n=0 ^ 



1 — X/z 1 — iijz 

\z\ > max(|A|, ImI). 



Hence, we find again 

In terms of Fibonacci numbers one can give a sharp estimate of the 
number of steps needed to end Euclid’s algorithm. 



8.8 Proposition. Let a, 6 G N with 0 < b < a. If Euclid’s algorithm on a 
and b ends in n steps, then a > /n+2 ^nd b > /n+i- 

In other words, if 6 < /„+i or a < fn-\- 2 ^ then Euclid’s algorithm ends in 
at most n — 1 steps. 

Proof. Write Euclid’s algorithm as 



r_i = a, ro = 6, 
^j+i ~ '^j — i ~ ^j'^j 



until rn -t-i = 0, so that Euclid’s algorithm has n steps. Observe that 
r-n+i-j > fj Vj e {-1,0, ...,n+ 1}. 

Since we have rn+i = 0 = /o, rn = g.c.d. (a, b) > 1 = fi, and by induction 

'^n+l—j — Qn—j'f'n—j ^ fj — l “t fj — 2 — fji 



we infer 



a = r_i>/n+ 2 , and & = ro>/n+i- 



□ 



Notice that the estimate on the number of steps of Euclid’s algorithm 
in Proposition 8.8 is sharp, since for a = /n +2 and b = /n+i, we have 

^1 fn^ ^2 /n — 15 S*C.d. (/n+2?/n+l) — n ~ “ 1) ^n+1 “ /o “ 0* 

Thus Euclid’s algorithm stops in n steps. 



8.9 Corollary (Lame). The number of steps needed to end Euclid’s al- 
gorithm does not exceed five times the number of the (decimal) digits of 
the divisor. 
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Figure 8.4. 



Proof. Let n be the number of steps of Euclid’s algorithm to divide a by b, and let k 
be the number of digits of the divisor. From Proposition 8.8, 

10'= >6>/„+i. 

On the other hand, it is easily seen by induction that 

/„. > (5)-‘ 



Since (8/5)® > 10 we have 



10 ®*= > 



^8^5(n-l) 



> 10"“■^ 



that is n — 1 < 5k. 



□ 



8.10 The number r = is the golden ratio^ of Greek geometers. With reference 

to Figure 8.4, show that, if 1 is the side, then 

(i) the length of the diagonal is the golden ratio r, 

(ii) the side of the internal pentagon is r~^. 



8.1.2 Some nonlinear examples 

a. Simple examples 

8.11 Example. Consider the recurrence 



XQ = a > 0, 

^n+l ~ 71 ^ 0. 



(8.9) 



If a = 1, then Xn = I Vn. If a > 1, we see by induction that Xn > ^ Vn, hence 
Xn-\-i = y/^ < Xn, Vn, i.e., {xn} is decreasing, therefore Xn ^ L and 1 < L < a; 
actually, passing to the limit in (8.9), we see that L = y/Z, i.e., L = 1. Similarly, if 
a < 1, then Xn < 1 Vn, {xn} is increasing and ccn 1- We conclude that for any 
a > 0, the sequence {xn} defined by (8.9) converges to 1. 

Alternatively, it is easily guessed and proved that the sequence defined by (8.9) is 

Xn = , n>0, thus Xn 1. 

^ The golden ratio is the inverse of the golden mean, which is the proportion of the 
division of a segment so that the smaller is to the larger as the larger is to the whole. 



312 



8. Discrete Processes 



8.12 Example. Consider the recurrence 



xo = Q! > 0, 
1 

— 



y/Xn 



n > 0. 



( 8 . 10 ) 



If a = 1, then Xn = 1 Vn. Also, if a > 1 we have Xn > 1 if n is even and Xn < 1 if n is 
odd. Moreover ^ 

X2n+2 = — ;== = ^X2n- 



y/X2n-\-l 



We deduce X2n = ^ ^ 0, hence 



’ 

concluding that X2n?a^2n+i — ^ 1 and Xn — ^ 1 since even and odd integers exhaust all 
integers. 



8.13 Example. A limit situation occurs for the recurrence 

{ xo = a > 0, 

Xn+l = — n > 0. 

Xn 

Clearly Xn = ot n is even and Xn = 1/a if is odd. We conclude that {xn} has limit 
if and only if a = 1. 



b. Evaluating algorithm performance 

In evaluating the performance of algorithms one considers a characteristic 
time as a function T(n) of a parameter n which describes the size of data 
on which the algorithm works. Often, due to the structure of the algorithm, 
one gets recursive estimates on T(n) of the type® 

T(2n) < 2T(n) -h n, Vn. 

One can prove that in fact this estimate is equivalent to the estimate 
T(n) < Cnlogn, i.e., using the Landau notation^ to 

T(n) = 0(n log n). 

8.14 Proposition. Let T : N R be a positive increasing function such 
that 

T(rn) < T°‘T(n) + Bn>^, Vn > 0, (8.11) 

where tgN, r>2, B>0, a>0 and f3 > 0 are independent of n. Then 

(i) if a ^ P, then T{n) = i.e., there exists a constant 

C = C(a,/?,r,T(l),B) such that T{n) < Vn, 

(ii) if a = p, then T{n) = 0(n" logn), i.e., T(n) < Cn" logn Vn > 2 for 
a suitable constant C depending on T(l), B, a and r. 



® See e.g., A. Aho, J. H. Hopcroft and J. D. Ullman, Data Structures and Algorithms, 
Addison- Wesley, 1983. 
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Proof. We deduce from (8.11) 

T(r) < r^T(l) + B, 

T(r^) < t“T(t) + + Bt°‘ + Bt'’, 

and inductively 

k 

T(t'‘+ 1) < + Br*"^ V fe. (8.12) 

3=0 

(i) If a < /?, (8.12) yields 

_(a-/3)(fe+l)-l 

T(t*^+ 1) < + Br'''^ ^^3^3 j 

where C := tT(1) + 

Given n G N, we can choose k in such a way that < n < and conclude 

T(n) < T(r^+i) < < Cn^, 

since T(n) is increasing. Similarly, if a > /?, (8.12) yields 

_(a-/3)(fe+l)-l 

T(r^+^) < r(^+i)“T(l) + — < 

where C := tT(1) + B - ■ Thus (i) is proved. 

(ii) If a = /3, (8.12) yields 

T(r^+^) < r(^+^)“T(l) -h + 1) < (rT(l) + B{n + 1)), 

hence, if k is chosen in such a way that < n < i.e., k < log.,, n < fc + 1, we find 

T(n) <T(r^+^) < r^“(r“T(l) + B(log.,. n + 1)) < (r^ + B + S log.,, n), 

T(n) being increasing, hence, 

T{n) <Cn^\ogn Vn > 2, 

for a suitable constant C. □ 

8.15 Quicksort. The average number of comparison steps Cn made by the Quicksort 
algorithm for sorting n random elements is given by 

2 ^ ^ 

Cq = 0, Cn = ^ “f" 1 H — ^ Cj, Tl ^ 1. 

J=0 

Multiplying by n we get for n > 1, 

inCn=n^+n + 2'£]-^Cj, 

1 (n + l)Cn+i = (n + 1)^ + (n + 1) + 2 T^.”_n Cj- , 

and subtracting the first term from the second 

_ n + 2 
^n+l — , 1 

n + I 

This is written, for Sn •= Cn!(p> + I), as 
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Figure 8.5. 



E n 
j=0 



2 

J + 1 



I SO = 0, 

\^Sn+l = Sn +2/(n + 2), 

2 = 2/(j + 1). According to the above 



1 

Cn := (n + l)sn = 2 (n + 1) V] "t = 2(n + l)i/n+i - 1 

where Hn ’= ^/j is the n-th partial harmonic sum. Consequently (6.21) yields 

Cn ~ 2(n + 1) log(n + 1) Vn or Cn = 0(n log n). 



c. Rate of convergence 

8.16 Newton’s approximation method. Let / : [a, 6] R be a con- 
tinuous function with f{a) < 0 and f{b) > 0. Theorem 2.51 states that 
/ has at least one zero, and the proof provides an algorithm to approxi- 
mate that zero. In the case in which / is also convex or concave, Newton’s 
method turns out to be more efficient. 

Assume / convex, continuous in [a, 6], /(a) < 0, f{h) > 0 so that / has 
a unique zero c G [a, 6], and (see, e.g.. Chapter 4 of [GMl]) 

f{x) > 0, f\x) > f'{c) >0 for X G [c, b]. (8.13) 

Let {xn} be the sequence defined by the recurrence 



I xo = 6, 

1 f{Xn) 

[Xn^l=Xn-f^y 

Clearly Xn+i is the point where the tangent to the graph of / at the 
point (xn,/(^n)) intersects the x-axis. Since / is convex, c < Xn Vn and 
by (8.13), Xn+i < Xn Vn. Thus the sequence {xn} converges to a point 
L G [c, b] that, as we see passing to the limit in the recurrence, is given by 



L = L- 



f{L) 

fW 



i.e. 



f{L) = 0, i.e., L = c. 
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{xn} is therefore a sequence of approximants of c from above. A sequence 
of approximations {yn} from below can be obtained by defining yo = a and 
2/n+i as the point at which the secant through {yn^ f ivn)) and (xn, /(^n)) 
intersects the x-axis. 

Notice that Heron’s algorithm in Exercise 2.99 is Newton’s method 
applied to /(x) = x^ — a. 

Let g : [a, 6] — > [a, 6] be a function of class C^([a, 6]) and let {xn} be 
defined by the recurrence 



xo := a G [a, 6], 
Xn+l = p(^n). 



(8.14) 



If {xn} converges, then Xn ^ L e [a, 6] with L = g{L), that is L is a 
fixed point of g. Prom Taylor’s formula with Lagrange remainder (see, e.g., 
[GMl]), 

^n+l di^n) — 9{^) 9 (^n)(^n ~ Lf ^ = ^(t/) G [t/, L], 

we infer for the error *= ~ L|, 



Sn-\-\ — |x7T,-|_i L\ — |p(^n) L| ^ M\x-fi L/\ — MSyi 

where M := sup^^j^ |^'(x)|. Since we assumed Sn — > 0, we have Sn < 
(1/2)’^ and {5n} decays exponentially to 0. 

If moreover g^{L) = 0, Taylor’s formula with Lagrange remainder yields 
also 

gi^n) = 9{L) + - Lf, 

hence 



5n+i = \Xn+l -L\ = |5(a;n) “ 9{L)\ < JV|x„ - L\^ = NSl 
where N := sup^,^!^ |p"(x)|. Therefore, we find for p G N and n > 1, 

^n+p ^ {MSp)^ . (8.15) 

We say that a sequence {x^} converges rapidly to L G M if \xn — L\ < c?"" 
definitively, with a < 1. Ftom the previous argument we then conclude the 
following. 

8.17 Proposition. Let g G C^([a, 6]), let L G [a, 6] be a fixed point 
of g, g{L) = L, and let {xn} be the sequence defined by (8.14). If 
liminfn-.oo |^n “ L| = 0, (in particular if {xn} converges to L), then {xn} 
converges rapidly to L. 
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Figure 8.6. A classical introduction to 
number theory. 



AN INTRODUCTION 
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THEORY OF NUMBERS 
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IS, Jf, 



JOirjTTIT xftftSVH 
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AT TB£ CLARENDON RtlRSii 



In the case of Newton’s approximating sequence 

{ xo = ae [a, 6], 

Xn-^l—Xn f'{xr^)^ 

if {xn} converges, then the limit L is a fixed point of the function 

2/e[a,6]. 

Assuming moreover that / G C^([u, &]) and f'{L) ^ 0. Then g G C^([a, 6]) 
and g'{L) = 0. Therefore, if {xn} converges to L, then {xn} converges 
rapidly to L. 

8.18 Let {xn} be a sequence of positive real numbers such that 

o;n+i < 

with C > 0, B > 1 and e > 0. If xq < , then Xn < hence 

Xn 0. 



8.1.3 Continued fractions 

a. Definitions and elementary properties 

The finite continued fraction operation consists in computing, starting from 
n + 1 nonnegative numbers {ao, . . . , a^}, which are all positive except for 
ao, the number 
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do + 



1 



di + 



1 



d2 + 



1 



ds ... 

1 

+ — 
dfi 



One refers to the result as to the (finite) continued frdction of ao, . . . , 
Since the previous notation is heavy, one prefers lighter notation, as 



11 11 

ao H ; . . . ; , 

ttl+ U2+ dn-l-r dn 

or, as we shall do, 

[q-Oj • • • ? df^. 

The numbers ao, . . . , Un are called the quotients of the continued fraction 
[ao, . . . , Un], while for 0 < fc < n, the continued fraction [ao, . . . , a^] is 
called the k-th convergent of [ao, . . . , dn]> 

Observe that [ao] = ao,^ [ao,ai] = ao + and more generally 



[ao, . . . , dji\ — [ao, . . . , O-t^— 2, 1 ~ 1 " ] — [^O? • • • ? ^n— 2? [^n— 1 ) ^n]]? 

dn 

[ao, . . . , an] = ao + t 7 = [ao, [ai, . . . , an]], ( 8 . 16 ) 

[d'l , • • • , dfi\ 

[ao, . . . , dfi\ = [ao, . . . , o-/-, [a/-_|_i, . . . an]] VAj, 0 ^ Aj ^ ?i. 



Finally, observe that the map 



(ao, . . . , dfi) > [ao, . . . , an] 



is strictly increasing in each of the variables with an even index and strictly 
decreasing in each of the variables with an odd index. 

Computing [ao , . . . , dn] by its definition consists in the following: start 
from the last an, take the inverse 1/an, add an-i, compute the inverse of 
the result, add dn -2 and so on by downward induction until one adds ao. 
The following iterative scheme. 




[ao,ai] 



<^0^1 H" 1 

di 



[ao,ai,a2] 



aoaia2 + a2 + ao 

d2di + 1 



computes [ao, . . . , dn] by upward induction, and reduces the computation 
of [ao, . . . , dn] to a sum. 

Let [ao, . . . , dn] be a continued fraction. Define po? • • • and ^o? • • • ? 
by 

Po = 

Qo = I5 



I Pi — dQdl + 1, 
[ ^1 = di, 



( 8 . 17 ) 



® In this context [ao] = ao and not, as usual, the integral part of ao- We denote instead 
the floor of x by xljl, j j being the integral division. 
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and, for fc = 2, . . . , n, 



Pk = CLkPk-i -\-Pk-2, 
Qk = O'kQk-l + Qk-2- 



Then ^ 0 Vfc and we have the following. 



(8.18) 



8.19 Proposition. We have 

^ = [ao,...,afe], Vfc = 0,l,...,n. (8.19) 

Qk 

Moreover 

PkQk-i - Pk-iQk = , k = l,...,n (8.20) 

Pkqk- 2 -Pk- 2 qk = {-'i-)’"ak, k = 2,...,n. (8.21) 

In particular 

[ao,...,afc] = ^ + V-tlli, k = l,...,n. (8.22) 

qo “ qjqj+i 



Proof. We first prove (8.19) by induction on k. Of course, (8.19) holds true 
for fc = 0, 1. By induction assume the claim true for fc, then 



[ao, ai, . . . ,afc+i] 



ao, ui, . . . , afc-i, Uk + 



otfc+i J 



(°fc + +Pfc-2 

+ qk-2 



O-k+ljO'kPk-l +Pk-2) + Pk-1 _ ^k+lPk + Pk-1 _ Pk+1 

0'k+l{0'kqk-l + qk- 2 ) + qk-1 0-k+iqk + qk-\ qk+1 



Then (8.19) follows. As 

Pkqk-l -Pk-iqk = {o-kPk-l + Pk- 2 )qk-l ~ Pk-li^kqk-l + qk- 2 ) 
= —{pk-iqk -2 - Pk- 2 qk-l), 



by repeating the argument, we get 

Pkqk-i -Pk-iqk = (-l)'‘‘^^(Pi9o -Poqi) = (-i)''"S 



i.e., (8.20). Also 

PkQk-2 —Pk-2Qk = {o^kPk-l Pk-2)Qk-2 ~ Pk-2{(^kQk-l Qk- 2 ) 
= akiPk-iqk -2 - Pk- 2 qk-l) = (-!)''<!*:• 

i.e., (8.21). 

Finally, (8.20) is written as 
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n 


Pn/ Qn 


Pn/ Qn 


0 


0.000000000000000 


0/1 


1 


1.000000000000000 


1/1 


2 


0.666666666666667 


2/3 


3 


0.750000000000000 


3/4 


4 


0.748691099476440 


143/191 


5 


0.748704663212435 


289/386 


6 


0.748701973001038 


721/963 


7 


0.748702422145329 


1731/2312 


1731/2312 = 0.748702422145329 = [0, 1, 2, 1, 47, 2, 2, 2] 



Figure 8.7. The continued fraction expansion of 1731/2312. 



El _ Pfe-i (-1)*" ^ 

Qk Qk-1 QkQk-1 



ioT k = 1, . . . ,n, hence 



Pk Po ^ ^ (~iy ^ 

% Qo “ qjqj-i ’ 

hence (8.22), by taking into account (8.19). □ 

In the rest of this section we are interested in simple continued frac- 
tions^ that is, in continued fractions in which all the quotients are integers. 
Clearly in this case the continued fraction is a rational number. 

8.20 Definition. A continued fraction [ao, . . . , Un] is simple if G N Vi 
and Oi > 1 Vi > 1. 

For simple continued fractions. Proposition 8.19 is particularly useful. 
We have the following. 

8.21 Proposition. Let [ao,...,an] be a simple continued fraction, and 

let Pk/qk •= • • • ? O'k] be irreducible. Then 

(i) {pk} and {qk} are the numbers deSned in (8.17), (8.18), 

(ii) qi > qo and qk > qk-i Vfc > 2, 

(hi) qk> k Vfc, and qk > kVk > 3. 

Proof. The numbers po? • • • and qo,. • • ,qk defined by (8.17), (8.18) are 
integers by definition, moreover they are coprime by (8.20) and Pk/qk = 
[ao,...,afc] by (8.19). Thus (i) follows. Finally we have Qn = fln^n-i + 
qn -2 > qn-i + 1 if n > 2, hence (ii), while (hi) follows at once since 
qn > qn-i + qn -2 > ^n -1 + l>nifn>3. □ 
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A continued fraction does not fix its quotients as, for instance, 

[cLQj . . . , dn] — [^0) • • • ) ^n— 15 dfi — 1,1] if CLfi > 1, 

[do 5 ♦ • • j ^n] “ • • • 5 ^n— 1 “I” 1] if d/fi = 1. 

However a simple continued fraction fixes its quotients apart from the 
previous ambiguity. More precisely, we have the following. 



8.22 Proposition. Let [/iq, . . . , hn] and [ao, . . . , dm] be two simple con- 
tinued fractions. Suppose that [/iq? • • • ? hn] = [ao, . . . , dm] dnd, for conve- 
nience, m>n. Then 

o either m = n and hi = di \/i = 0, . . . ,n, 

o or m = n 1, hi = di = 1, . . . ,71 — 1, hn = dn 1 and dm = 1- 

Proof. We proceed by induction on n. Let n = 0. Either m = 0, hence ho = [ho] = 
[ao] = ao, or m > 0. In this case we have 



ho = [ho] = [ao, . . . , a-m] = ao + r 

[ai,...,aTnJ 

from which we infer [ai, , am] = 1, which in turn implies m = 1, ai = [ai , . . . , am] = 1 
and ho = ao + 1. 

Assuming now the claim true for simple continued fractions with n quotients, let 
us prove it for a simple continued fraction with n -j- 1 quotients. Assuming n > 1, by 
(8.16), we have 

[ho, [hi, . • • , hn]] — [orO, [ai, . • • , aTn,]] 

hence by the inductive assumption ho = ao and [hi, ... , hn] = [ai, . . . , am] - Using again 
the inductive assumption, we reach the conclusion. □ 



8.23 Corollary. Let [ao, . . . , dn] and [6q, • • • , &m] be two simple continued 
fractions. Suppose that dn, bm > 2, and that [ao, . . . , a^j = [bo,. - - , bm]- 
Then n = m and a^ = 6^, Vi = 0, . . . , n. 



8.24 Definition. Let {dn}, n = 0, 1, . . . be a list of real numbers such 
that do > 0 and di > 0. We refer to the sequence 

|[oo,...,o„]|n = 0, 

as to an infinite continued fraction, and we write it as [do, ... ,dn, .. .]. For 
any integer n, the (finite) continued fraction [do, ... , dn] is called the n-th 
convergent of [do, . . . , a^, . . . ]. If [do, . . . , a^j ^ a: G R as n oo, we also 
write x= [do, . . . ,dn, . . .]. 
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b. Developments as continuous fractions 

The following algorithm leads us to simple continued fractions. Let x be a 
real number. Let ao be its integral part^ Uq := ^//l, and let ai := x — ao 
be its fractional part, so that 



X = uq ^ij ^0 ^ N ) 0^0 ^ 0 ) ^1 ^ [bj ![• 



If a 1 7 ^ 0, we can write 



1 

X — Oq -\ 

ai 



and, since 1/ai > 1, we can reiterate the procedure, 



Oi := llaxl/l, a2 := 1/ai - ai, 

1 1 r 1 

— — ai + 02 — Ui H — 1“ , — [uq? ^1 + <^2j 

Oi — 

^ a2 



to write 

1 1 

X — ao + oi — ao H ; — ao H — • • • 

ai + 02 ,1 

ai H j 

d2 + 

= [ao, ai, . . . , afc + Ofc+i] (8.23) 

for fc = 1,2, ... as long as Ofc > 0. Thus the algorithm either ends at 

the first fc = n at which o^+i = 0, and in this case x = [ao, . . . , a^], or 

eventually continues indefinitely. 

All the a^’s we find in this way are nonnegative integers and a^ > 1 
Vi > 1, thus the resulting continued fractions are simple. We refer to that 
algorithm as to the continued fraction expansion algorithm and to the 
resulting list of continued fractions when applying the algorithm to x as 
to the continued fraction expansion of x. 

Finally, we recall that, since (ao, . . . , an) — ^ [ao, . . . , an] is strictly in- 
creasing in the variables with even index and decreasing in the variables 
with odd index, 

[ao, . . . , a 2 k] < X = [ao, . . . , U 2 fc-i 5 H ] (8.24) 

= [ao, . . . , a 2 fc, U 2 fc+i H ] 

0^2k-\-2 

< [ao, . . . ,a 2 fc+i] 

as far as the continued fraction expansion algorithm continues. 

8.25 Euclid’s algorithm. Let x > 0. Euclid’s algorithm is a means to 
find iteratively, if it exists, a common submultiple of x > 0 and 1, see 
Section 1.1 and Figure 1.5, by 
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Start with x eR, then compute {ufe} and with 



ao := 



1 

1 

X 



' 

(^k 1 / j 

^ ak := Pk/n, 

^Oik+l ■= Pk - flfe 



for fc = 0, 1, . . . , as far cis / 0. 

By construction 

X = ao + ao = [ao + ao] = [ao, a± + ai] = • • • = [ao, ai, . . . , a^ + a^+i] = . . . 

for fc = 0, 1, . . . , as far as the algorithm continues. Eventually the algorithm stops 
at the first A; =: n for which afc_|_i =0. In this case we also have 

X [ao , ai 5 . . . , aTT,] . (8.25) 



Figure 8.8. The continued fraction expansion algorithm. 



ro = n = 1, 

^ Qj •= {rj-ilrj)llh (^-26) 

Jj-i =Qjrj+rj^i, 

for j = 1, . . . , as long as rj > 0. We refer to it as to Euclid’s algorithm 
starting from (x, 1). The algorithm eventually stops at the first A: =: n for 
which r^+i = 0. In this case x = srn, 1 = trn^ s,t G N, and x = s/t 
is rational. Moreover, the last quotient Qn is larger than or equal to 2. If 
conversely x is rational, then Euclid’s algorithm surely stops after a finite 
number of steps. In fact, if x = p/q^ p^q coprime, the remainders are 1/q 
times the corresponding remainders of Euclid’s algorithm starting with 
{p,q), which form a list of strictly decreasing integers, see Section 3.1.1. 
Thus Euclid’s algorithm, starting with (x, 1), i.e., (8.26), stops after a 
finite number of steps if and only if x is rational 

The development of x as a continued fraction is a rewriting of Euclid’s 
algorithm (8.26). We in fact have the following. 

8.26 Theorem. Let {aj}, and {aj} be the numbers produced by the con- 
tinued fraction expansion algorithm of x, and let {qj}, {vj} be respectively 
the quotients and the remainders of Euclid’s algorithm (8.26) starting with 
(x, 1). Then 

qj = aj, = aia2 . . . 

Thus the continued fraction algorithm starting with x > 0 produces 

o either the quotients of a finite simple continued fraction [ao, . . . , On] such 
that X = [ao , . . . , Un] , if X is rational; in this case, ifx = p/q,p,q coprime, 
we have x = [ao, . . . ,an] where n is the number of steps in Euclid’s 
algorithm to compute g.c.d. (p, q) and an > 2; 
o or an infinite continued fraction [ao, . . . , an, • . • ] if x is irrational 
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n 


PnlQn 


Pn/ Qn 


1/ {QnQn+l) 


1 


3.000000000000000 


3/1 


le + 00 


2 


2.666666666666667 


8/3 


3e-01 


3 


2.750000000000000 


11/4 


8e-02 


5 


2.718750000000000 


87/32 


CO 

o 

1 


7 


2.718309859154930 


193/71 


4e-04 


9 


2.718283582089552 


1457/536 


4e-06 


11 


2.718281835205993 


23225/8544 


le-07 


13 


2.718281828735696 


49171/18089 


6e-09 


e = 2.718281828459045 = [2, 1, 2, 1, 1, 4, 1, 1, 6, 1, 1, 8, 1, 1, 10, ... ] 



Figure 8.9. The continued fraction expansion of Euler number e. 



If X is rational, then the continued fraction expansion [ao, . . . ,Un] of 
X is the only simple continued fraction with an > 2 which equals x, see 
Corollary 8.23. 

8.27 f . Write a detailed proof of Theorem 8.26. 



c. Infinite continued fractions 

From Propositions 8.19 and 8.21 we easily get the following. 

8.28 Theorem. Let [ao, . . . , an, . . . ] be a simple inGnite continued frac- 
tion, and letpn/qn be the irreducible representation of the n-th convergent 
[ao, . . . , ttn]. Then {pn/qn} converges to x eR, 

X [uq , . . . , Un ,•••], 



where 



oo 

X := qq + 

j=o 



i-iy _ 

qjqj+i 



Moreover, 

(i) X — Pn/qn is positive if n is even and negative if n is odd, so that 



P2n ^ ^ P2n-\-l 

< X < , 

q2n q2n+l 



(8.27) 



(ii) we have 



1 




qn 



< 



1 



qn{qn + 9n+l) 



< 



9n^n+l 




324 



8. Discrete Processes 



(iii) we have 



QnX -Pn = 



^n+1 



where 0 < 5n < 1? 
(iv) the differences 



\qnX-Pn\, 




9n 



are strictly decreasing as n increases. 



Proof. Prom Proposition 8.21 the sequence {qjqj-\-i} diverges and is strictly 
monotone Vj. Moreover (8.20) yields 



[OrQ? • • • ) ^n] 



qn 



n—1 



= Oo + ^ 
3=0 



(-1V' 

qjqj-\-i 



Since the series 

ytDi. 

J^O Wi+i 

is alternating and the absolute values of the terms tend strictly to zero, 
the Leibniz test applies, hence Pn/qn where 



x = + y (8.28) 

and (i) and the estimate from above of \x—pn/qn\ in (ii) hold. The estimate 
from below in (ii) follows also from the Leibniz test since 

1 1 Qn+l 

9n^n+l ^n+l^n+2 9n(nn+l^n+l “f" ^n) 

1 

qn(Sln-\-l “1“ ^n) 



X — 



Pn 

qn 



> 

> 



(iii) follows from (ii). 

(iv) Prom (ii) we infer 



\qn-\-lX - Pn-^l\ < 



qn 



qn+2 
^ qn 



qn{(^n-\-2qn "1“ ^n+l) 

< \qnX-Pn\ 



qn{qn + qn-\-l) 



thus {\qnX —Pn\} is strictly decreasing. As a consequence {|x —Pn/qn\} is 
strictly decreasing, too. □ 



The next proposition shows that a simple infinite continued fraction is 
completely identified by its limit. 
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n 


Pn/ Qn 


Pn/ Qn 


l/(9n9n+l) 


1 


2.000000000000000 


2/1 


le + 00 


2 


1.500000000000000 


3/2 


5e-01 


3 


1.666666666666667 


5/3 


2e-01 


5 


1.625000000000000 


13/8 


3e-02 


7 


1.619047619047619 


34121 


4e-03 


9 


1.618181818181818 


89/55 


5e-04 


11 


1.618055555555556 


233/144 


8e-05 


13 


1.618037135278515 


610/377 


le-05 


- 1.618033988749895 - [1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, ••■ ] 



Figure 8.10. The continued fraction expansion of the golden ratio. 



8.29 Proposition. Let [ao, . . . , a^, . . . ] and [6o, • • • , • • • ] be two inSnite 

simple continued fractions that have the same limit. Then an = bn ^n. 

Proof. For i = 0, 1, 2, . . . , denote by Xi the real numbers 

Xi ’.= ... 5 an<) • • • ] lim « . . ? (^n] 

n — ^cx) 

which exist by Theorem 8.28. We first observe that 

1 < Oi < Xi < Oi 1 Vi, (8.29) 

in fact, since an infinite continued fraction never ends, Xi = Oi is 

strictly larger than ai Vi. Consequently Xi = ai~\- + 1. 

Then (8.29) yields 

0 < -T_ Vi>l. 

Oi “1“ 1 Xi Oi 

In particular, 1/xi < 1. If now [ao, . . . , Un? • • • ] = [^o? • • • ? • • • ]? then 

1 L 1 

Uo H — ao H . 

yi 

Since ao,bo EN and l/xi, 1/yi g] 0, 1[, we then infer ao = bo and 

[tti , . . . , On 5 • • • ] [^1 ? • • • ) bn ?•••]• 

We then iterate the previous argument to get ai = 6i, U 2 = — C 

We therefore conclude the following. 

8.30 Theorem. Let x be irrational. Then there is a unique simple con- 
tinued fraction that converges to x: the continued fraction expansion of x. 
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Proof. Let [ao, . . . , an, . . . ] be the continued fraction expansion of x. By 
Theorem 8.28, [ao, . . . , Un, . • . ] converges to y E R and 

[ao, . . . , a2n] < y < [O'Oi • • • j CL2n+l]- 

Since also 

[uq, • • • , 6!'2n] ^ ^ ^ [^^0? • • • ? ^2n+l] 

by construction, we infer y = x letting n — > oo. The uniqueness is stated 
as Proposition 8.29 □ 



d. Irrationals and approximations by rationals 

Actually the n~th convergent PniQn = • • • ? of the continued fraction 

expansion of x is the best approximation of x among all fractions whose 
denominator does not exceed yn- 



8.31 Theorem (Best rational approximations). Let x be irrational 
and let PniQn be the n-convergent of the continued fraction development 
of X. Then Vn > 2, Vp,q e N coprime with q < Qn and p/q ^ PniQn, we 
have 

\qnX-Pn\ < \qX-p\, 

Proof We have already proved that 



Pn 


< 


P 


X 


X 


Qn 




Q 



\qnX-Pn\ < \qn-lX - Pn-l\^^n > 0 , 

hence the claim follows by downward induction if we prove it for p, q such 
that qn-i < 9 < and p/g 7^ Pn/^n- 

If g = g„, we have |p — Pn| > 1 and Qn+x > 2, hence 

\qnX-Pn\ < ^ < \\Pn-p\ < \\qnX ~ Pn\ + \\qX ~ p\. 

^n+1 2i L 1 A 

If qn-\ < q < qn, and p/q ^ p„/g„ we also have p/q pn-x/qn-x- 
Write 

fp = ap„_i + /?p„, 

[q = aqn-x +/?9n, 

that is 

f a{Pnqn-X - qnPn-x) = P9n “ qPn, 

\0{Pnqn-X - qnPn-x) = P«n-1 ~ qPn-X, 
i.e., 

a = (-l)”"^(pg„ - qPn), [3 = (-l)”“^(pg„_i - qPn-x), 

in particular a and j3 are nonzero integers. Since qn-i < q = ayn-i+/?9n < 
gnj Oi and j3 must have opposite signs, while pn — qnX and Pn-i — qn-ix 
do have opposite signs. Thus 
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a{Pn-l - qn-ix) and /3(p„ - QnX) 
have the same sign; hence 

\p-qx\ = \a{pn-l-qn-lx)\ + \0{Pn-qnX)\ > \pn-l~qn-\x\ > \pn~qnX\. 



□ 



8.32 Example. According to the definition of the continued fraction expansion algo- 
rithm, one easily sees that [1,1,...,!,...] is the continued fraction expansion of the 

golden ratio, . Moreover pn = /n-i? Qn = /n, {fn} being the sequence of Fi- 

bonacci numbers. Since in this case 

go = 1, qi = 1 , 

gn+2 — 9n+l H" gnt 

we deduce 

^ - 1 + 

Jn ^ ^ ^_Q JnJn-\-\ 

We also have 

i(^/5 + l) = [1,1,1,!,...] =: [1,1], 

^/2 = [ l , 2 , 2 , 2 ,...] =: [ 1 , 2 ], 

V^= [2,4,4,4,...] =:[2,4], 

V7= [2, 1,1, 1,4, 1,1, 1,4,...] =: [2, 1,1, 1,4], 

Finally, as proved by Leonhard Euler (1707-1783), 

e = [2, 1, 2, 1, 1, 3, 1, 1, 4, 1, . . .] = [2, , 

gfc/2 I ^ 

^k/2 ^ fc = 1, 2, . . . . 

Instead, no formation rule for the continued fraction of tt is known. 



{ PO = 1, Pi = 2, 
Pn+2 ~ Pn+1 Pnj 



Prom Theorem 8.30 we readily infer that for the n-th convergent s of x, 
we have 

< 4 - 

qn qi 

in particular, if a is irrational, then there exist infinitely many rationals 
p/g, p, q coprime, such that 



X 

q 



< ~2' 



(8.30) 



Moreover if x is rational, x = a/b ^ p/q, we have 



\aq - bp\ 

bq bq 



thus, assuming (8.30), we get q <b and in turn (8.30) has a finite number 
of solutions. In conclusion, we have the following. 
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8.33 Theorem (Dirichlet). a is irrational if and only if the inequality 

\qa-p\ < i 

is satisBed by infinitely many integers (p, q), q> 0. 

However, the estimate (8.30) is not sharp in the following sense. It can 
be easily proved that of any two consecutive convergents of x, one at least 
satisfies the inequality 

<*■“> 

In particular, there are infinitely many convergents which satisfy (8.31). 
More can be stated in this direction. For instance, it has been proved that 
for any three consecutive convergents to x one at least satisfies 



P 1 
x--< 

q y/bq^ 



(8.32) 



and therefore we have the following. 



8.34 Theorem (Hurwitz). Every irrational x admits infinitely many ra- 
tional approximations p / q such that 

P . 1 

X < — F^- 

q Vbq^ 

The constant \/E in the Hurwitz theorem is sharp. It can be easily 
proved that if Pnlq-n denotes the irreducible representation of the n-th 
convergent of (1 + V^)/2, then q^\x —Pn/qn\ 1/v^, and that 

1 “I” y/b Pn ^ 1 

2 qn ~ Aql 

holds only for a finite numbers of convergents if A>Vb. 

We conclude observing that (8.31), that is satisfied by at least half of 
the convergents, characterizes the convergents. We have the following. 



8.35 Theorem. Let x be a positive real number. Assume that p, q are 
coprime integers. If (8.31) holds, then p/q is one of the convergents of the 
continued fraction development of x. 



Proof. By assumption 




where e = ±1 and 0 < a < 1. Let [uq, . . . , a^] be the continued fraction 
expansion of p/q, n being such that (—1)^“^ = e, and let Pk/qk be the 
fc-th convergent of [ao, . . . , Un], in particular p = Pn^ q = qn- Write x as 
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X — [do 5 • • • ? 



for a suitable z; according to Proposition 8.19 we have 



X = 



ZPn+Pn-l 
ZQu + Qn-l ’ 



thus 



^ ^Pn. _x== ^(Pn^n-l ~ Pn-lgn) 
ql Qn ZQn + Qn-l 



Z^Qn “f" Qn—1 

that is, z > 1. Thus the continued fraction expansion of z is a simple finite 
or infinite continued fraction [6 q, • • • , • • • ] with bo > I- We then infer 

that 

[do ? • • • ? ? ^0 5 • • • ? bfi , . • • ] 

is a simple continued fraction, and 



X — [do 5 • • • 5 CLn 7 — [^0 ? • • • ? ? ^0 ? • • • 5 5 • • • ] • 

Hence pjq— [do, ... , dn] is one of the convergents of the continued fraction 
development of x. □ 



8.36 Periodic continued fractions. The reader may have already no- 
ticed that \/2, \/5, \fl have periodic expansions as continued fractions. This 
is a general fact. Furthermore, we have the following. 

Theorem (Lagrange). A periodic continued fraction is a quadratic surd^ 
i.e., an irrational root of a quadratic equation with integral coefficients. 



e. Order of approximation and transcendental numbers 

We say that a is approximable by rationals to order n if there is a constant 
k = k{a) depending on a for which 




q 



< 



k{a) 

qu 



has infinitely many solutions. As we have seen, every rational is approx- 
imable to order 1 and not to a higher order, while every irrational is ap- 
proximable of order two (see Theorem 8.34). This way we separate the 
irrationals in classes that are further and further away from rationals. 



8.37 Theorem (Liouville). A real algebraic number^^ of degree n is not 
approximable to any order greater than n. 

10 \Ye recall that an algebraic number is a solution of an algebraic equation with integral 
coefficients. If x satisfies an equation of degree n, but none of lower degree, then it 
is said to be of degree n. 
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Proof. Let ^ be a root of 

fiO •— ^0^”' + ^ H h an = 0 

with Oi e Z, and let p/q ^ ^ be an approximation of We can assume 
that p/q lies in ]^ — 1,^ + 1[, and is nearer to ^ than any other root of 
f{x) = 0, so that f{p/q) 0. Trivially there exists M(^) such that 

|/'(x)|<M(0 VxgK- 1,^ + 1], 

and 

I I ^ |aop” + -I | ^ J_ 

r\q) \ ~ g" “5"’ 

since the numerator is a positive integer. Also by Lagrange’s theorem 

f{p/q) = f(p/q) - fiO = {^~ 

for some x lying between p/q and Therefore we conclude 

P-A = > — — 

q ^1 \f'{x)\ ~ M g"‘ 

□ 

We can translate Liouville’s theorem into the principle that rapidly 
converging sequences of rationals converge to a transcendental number, 
and simple irrationals like y/b — 1 or \/2 cannot be rapidly approximated 
by rationals: from the point of view of rational approximation the simplest 
numbers are the worst. 

Liouville’s theorem of course allows us to construct transcendental 
numbers easily. 



8.38 Example. If 

cx> 

? = 0 , 110001000 . . . = 10 “!’ + 10 “^’ + 10 “®' + . . . = ^ 10 “''*, 

fc=l 



and 



we have 



k=l 



P 

IQn! 



P 

q 



0<C--=?-?n= V 10“''' < 2 10“("+1)' < 2q-^ 



for n > iV. Consequently ^ is not an algebraic number of degree less than N. N being 
arbitrary, we conclude that ^ is transcendental. 



8.39 % Show that the number [1, 10, 10^, 10^*, 10^*, . . .] is transcendental. 
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Of course, it is more difficult to decide whether a given number is 
transcendental or not. For instance, only in 1873 Charles Hermite (1822- 
1901) proved that tt is transcendental, and in 1882 Carl von Lindemann 
(1852-1939) proved that e is transcendental, too. Even nowadays only few 
classes of transcendental numbers are known, for example, 

e, TT, sinl, log 2, log 3/ log 2, e^, 2^ 
are transcendental, but it is not known whether 

2^ 2^, 7T^ 

are transcendental or not; actually not even whether they are rational or 
irrational. We only state without proofs 

8.40 Theorem (Roth, 1958). The order of approximation of any alge- 
braic rational is 2. In other words, given an algebraic number a and k > 2, 
there are only finitely many rational numbers p/q solving \oL—p/q\ < c/q^. 

8.41 Theorem (Lindemann- Weierstrass). Let ai,...,an be n dis- 
tinct complex numbers. If 

Pie^^ “h • • • + = Oj fdj € C, 

then either at least one of the coefficients jSj is transcendental or all /3j 
vanish. 



8.2 One-Dimensional Dynamical 
Systems 

Roughly, a dynamical system consists of a set of possible states, together 
with rules that determine the present state in terms of the past states. The 
system is said to be continuous or discrete according to whether the rule 
is applied at continuous or discrete times. 

Let 7 be a closed interval in R or 7 = R, and / : 7 — > 7 be a mapping 
from 7 to 7. A typical discrete dynamical system is 



\Xq=X, 

ia;„+i = f{xn) n > 0. 

The sequence f^{x) := / o / o • • • o /(x) of the iterates of / is called 
the orbit of x under f. A point x e I such that f{x) = x is called a 
fixed point of /. According to the principle of induction, the orbit {xn} 
is uniquely determined by the initial value xq; nevertheless, for nonlinear 
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maps, even just quadratic maps, the sequence {xn} may have a behavior, 
or dynamics^ quite complicated, with effects due to nonlinearity, such as 
absence of convergence, presence of oscillations, sensitive dependence on 
initial conditions which lead to a complex distribution of the values {xn}, 
often referred to as deterministic chaos. Such dynamical systems occur in 
many contexts and, in particular, when discretizing differential equations 
or when studying mathematical models^ as, for instance, population models. 



8.2.1 Discretization and models 

Given a smooth function / : R R, Cauchy’s problem 

jx(0) = xo, 32^ 

has a unique solution at least in a short interval [0,T[. Discrete methods 
allow us to approximate such a solution. 



a. Euler’s method 

If we assume a small h as discrete approximation step, and replace in the 
differential equation the derivative with the differential quotient {x{t-\-h) — 
x(t))/h^ we find 

+ 

This suggests as approximate solution 

x^^\t) := Xk + {ht — k)xk-\-i t e [k/h, {k + fc = 0, 1, 2, . . . 



where 



Xq — Xqj 

Xk+i =Xk + hf{xk). 



(8.34) 



In other words Xk-\-i is the arrival point after time h if we start from Xk 
and move with constant speed f{xk)j while x^^^t) is the linear interpolate. 
The discretization error 



e{x{t),h) 






clearly is infinitesimal as h — ^ 0: in fact, we have the following. 



8.42 Proposition. The sequence of functions converges uni- 

formly to the solution x{t) of (8.33) on every bounded interval in which 
x(t) exists. 
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This easily follows from 



8.43 Proposition. Let M := sup]o,T[ l/(^)l> ^ •= supjo^rO l/^(^)l 
h := T/N. Then 

\XN - x{T)\ < - l)h. 

Proof. Prom the equation x'{t) = f{x{t)) we get 

|a;(t) - a:(s)| < J \f{x{T))\d.T = M\s-t\, (8.35) 

x{{k + l)h) = x{kh) -\-h f{x{s)) ds, fc = 0, 1, . . . , A/" — 1. 

Jkh 

If xo := a:(0), Xk-\-i := Xk + hf{xk) and Sk := \xk — x(fc/i)|, we then infer 



Sk-^i <Sk + h\f{x{kh)) - f{xk)\ + J (/(^( 5 )) - f{x{kh))j ds 

r{k^-l)h TT^ 

< (1 + Lh)5k -\-L |x(5) - x{kh)\ds < (1 + Lh)Sk + 

Jkh 2 

Iterating, we finally obtain 



^ (1 H" Lh)^5o + 



LMh? 

2 






j=Q 



LMh^ (l + L/i)^-l 
2 Lh 



since = 0, and for A: = iV the conclusion. 



□ 



Proof of Proposition 8.42. In fact for s G (fc + l)/i], we have 

— a:(s)| < - Xk\ + 1^:^ - x{kh)\ + \x{kh) — x(s)| < C h, 

since \x^^\s) — Xk\ < \xk-\-i—Xk\ < M/i by definition, \x{kh)—x{s)\ < Mh 
by (8.35) and \xk — x{kh)\ < Ch by Proposition 8.43. □ 



The first formal description of this numerical method for approximating 
solutions of an ODE seems due to Leonhard Euler (1707-1783), while 
the proof of convergence is due to Augustin-Louis Cauchy (1789-1857). 
However, it is worth noticing that the method had already been used by 
Sir Isaac Newton (1643-1727). 
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b. Runge— Kutta method 

Differentiating the equation x' = f{x) we find 

x'{t) = f{x{t)), 
x"{t) = f{x{t))f{x{t)), 
x'"{t) = r{x{t))f{x{t)) + {fix{t))ffix{t)), 

thus, defining 

$(x,/i) := fix) + ^/'(x)/(x) + y(/"(x)/2(x) + if'{x))^fix)) 
we may set up the iteration scheme 



I xo = xo, 

\xfc+i =Xk + h^{xk,h). 

Using the second order Taylor polynomial, similarly to the above one can 
show that this scheme converges and yields a better approximation than 
Euler’s method. And we can get even better approximations using higher 
order terms. 

But, from the numerical point of view, the computation of high deriva- 
tives is costly since it corresponds to computing a function at many points 
with higher accuracy. Therefore, while in principle we can get better and 
better approximations using higher order Taylor polynomials, this is not 
convenient as it leads to algorithms which are not very efficient. Thus, let 
us come back to the question of a good choice of 0(x, h) in the iteration 

Xfc+i = Xk + h^{xk,h). 

We have 

x(t + h) = X + hf{x) + ^/'(x)/(x) H- o(/i^), 

$(x, h) = $(x, 0) + h$'(x, 0) + 0(^2), 
the discretization error is then 

e(x, h) = /(x) + ^f'{x)f{x) + o(h^) - $(x, 0) - h$'(x, 0) + o(h^), 
that is, of order o(h^), if we have 

fix) + ^ fix) fix) - $(x, 0) - h^'ix, 0) = 0. (8.36) 

For example the choice 

^>(x, h) := Af{x) + Bf{x + Chf{x)), with A + R = 1, BC = ^ 

gives a family of solutions of (8.36) which yield approximating methods of 
order two 
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o The modified Euler method or middle point method corresponds to 
A = 0, B = l, C=l/2, 
o The method of Heun corresponds to 

A = 1/2, B = 1/2, C = 1. 

Similarly, one can construct methods of approximation of order 3, 4 or 
higher. The method of Runge-Kutta is a fourth order method defined by 

$(x, h) := i(mi + 2 m 2 + 2 m 3 + mi), 

D 

mi = f{x), m 2 = f{x + ^mi), 

W3 = /(x+^m2), m4 = /(x + ftms). 

8.44 11. The global error of approximation is defined by 

E{h) = sup \x{t) — 

[0,T] 

Show that in the case of Heun’s method and Euler’s method E{h) = O(h^), while in 
the case of Runge-Kutta E{h) = O(h^). 



c. Models 

Processes of the real world are often mathematically modelled in order 
to capture some of their characteristic and/or relevant aspects; from this 
point of view a model that is too close to reality may happen to be in- 
tractable and consequently useless, while a model which, though far from 
reality, identifies relevant specific aspects may be very useful: modelling is a 
kind of compromise. Industrial mathematics, biology, economics and social 
sciences are the context in which new models develop according to specific 
needs. Though fascinating, discussing even a few examples is not possible 
here, therefore we confine ourselves to presenting very briefiy two classical 
examples in population dynamics: the logistic model and Lotka-Volterra 
models. 

8.45 The logistic model. Let xq denote the initial size of a population 
and let {xn} be the size after n years. The rate of change is then 

Xn 

If such a rate is constant, say a, the dynamics is described by 

x„+i = (1 + a)x„; 

the size of the population after n years is = (1 + a)^xo,that is, the 
population increases exponentially if a > 0. Such a model describes the 
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1 
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0 
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-1 
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-2 
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Figure 8.11. The iterates of f{x) = c starting from a^o = 0 with, from the left 
c = —1, c = —1.5, c = —2. 






ideal situation in which no external influence is present. More realistic 
models should take into account influences of the environment. In 1845 
P. F Verhulst, starting from the assumption that the environment may 
only allow the survival of a threshold population Pq? that we take to be 1, 
formulated the hypothesis that the rate of change is proportional to 1—Xn- 
In this case the dynamics becomes 



Xn+l = (1 + Ct)Xn - aXn- 

This is the logistic model; see, e.g., Section 4.2 of [GMl], 



8.46 f . Show that Euler’s method with step h = 1 for the equation 

x' = a{x — x^) 



leads to (8.37). 



(8.37) 



8.47 Lotka— Volterra models. These are models often used to simulate 
the interaction between two or more populations. In the case of two species, 
because of the flniteness of resources, the rate of change, per individual, 
is adversely affected by high levels of its own species (as in the logistic 
model) and by the other species with which it is in competition. We have 

- = A{E -x)-By 

X 

and, a similar equation for the second population i.e., 

{ x’ = Ax{E — x) — Bxy^ 
y' = Cy{F -y)- Dxy. 

A special case is the so-called predator-prey model 



x' = ax — bxy^ 
y' = ~cy + dxy 
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Figure 8.12. (a) The iterates of (a) y/x starting from xq = 0.2 and of (b) l/{2x + 1) 
starting from xq = 0.2. 



that, in its discrete version, becomes 



Xk+i = (1 + a)xk - bxkVk, 
Vk+i = (1 - c)yk + dxkUk- 



8.2.2 Examples of one-dimensional dynamics 

In this section we illustrate some typical features of one-dimensional dis- 
crete processes. Let 

^n4-l “ fip^n)') 

be such a system, / : E — > R being a given smooth function. The orbit of 
an initial point Xq is given by the sequence {xn}, 

^n = /o/o/o...o /(xo), or simply Xn = P{xo)> 

^ V " 

n— times 



A graphic representation of the sequence {xn} may be given by the 
points (n, Xn) in the plane, possibly linearly interpolated, see Figure 8.11. 

Alternatively, we plot in the (x,y) plane the graph of y = /(x). We 
travel vertically from ( xq , xq ) till ( xq , /(^ o )) on the graph of /, then hori- 
zontally until we reach the diagonal, (/(xo,/(xo)) =: (xi,xi), and finally, 
we reiterate starting from (xi,xi), (x 2 ,X 2 ), etc., see Figure 8.12. 

Our first two examples concern very simple dynamics. 
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a. Expansive dynamics 

Let fc > 1 and let 

Xn-\-\ — kXfi. 

This simple dynamics, that we have already encountered several times, has 
a closed form description: 

Xn := k'^XQ. 

The f'^{xo) separate exponentially in time. Also starting from slightly 
different data xq, 2/o? then Xn — yn = k'^{xo — 2/o)> he., the orbits of xq and 
yo separate exponentially in time. Different of course is the case 0 < fc < 1: 
all initial points tend to zero: they are “attracted” by zero; zero is called 
a sink 

b. Contractive dynamics: fixed points 

A contraction map or simply a contraction on an interval / C M is a map 
f : I ^ I which shrinks distances uniformly, i.e., for which there exists 
a constant L, 0 < L < 1, such that \f{x) - f{y)\ < L\x - y| V E /. 
Of course a contraction map is a Lipschitz map, in particular contraction 
maps are continuous on I. 

8.48 Theorem (Contraction mapping theorem). Let I be a closed 
subset of R, for instance a closed interval, a closed half-line, a finite union 
of closed intervals, or R, and let f : I I be a contraction with contraction 
factor L < 1. Then f has a unique fixed point xq G /. Moreover, the orbit 
of any point x e I converges at least exponentially to xq, 

|/”(x) - Xol < - ^1 

Proof, Uniqueness, li x,y ^ I are two fixed points, we have 

- 3/1 = l/(^) - f{y)\ <L\x- y\, 
hence x = y, since L < 1. 

Existence. For any x £ I, consider its orbit {xn}, Xn := f^{x). We then 
have 

^k\ — 1 | ^ ^ L |xi xqI L |y^(x) x\ 

hence, for g > p > 1, 

j^p 

kg - Xpl < ^ |xfe+i - Xfel < ^ -x\< |/(x) - x| YZrZ’ 

h=p h=p 

since L < 1. Since the right-hand side converges to zero as p oo, {xn} is 
a Cauchy sequence and therefore converges to some xq G R. Passing to the 
limit in Xn+i = /(xn), we see that xq is a fixed point for /, thus proving 
that / has a fixed point, hence a unique fixed point, and that each orbit 
converges to it. Finally, (8.38) follows passing to the limit as ^ oo in 
(8.39). □ 
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Figure 8.13. The iterates of f{x) — 2.Sx{l — x) starting from x = 0.8. 



c. Sinks and sources 

Let us begin by illustrating a number of phenomena associated to generic 
maps. 

8.49 Example. Let f{x) := 2x(l — x), x G [0, 1]. Clearly, / : [0, 1] — > [0, 1] has two 
fixed points, 0 and 1/2, and the orbit of every point x £ [0, 1], x 7^ 0, converges to 1/2; 
in fact, {/^(x)} is increasing, and passing to the limit in Xn+i = f{xn), necessarily, 
/-(x) ^ 1/2. 

Trivially we have 

8.50 Proposition. Let f : I ^ I be a continuous map, and let {f^{x)} 
be the orbit of x e L Then 

(i) if f^{x) ^ p as k + 00 , then p is a fixed point, f(p) = p. 

(ii) if f^{x) = p for some n, and p is a fixed point, then f^(x) = pVk > n. 

Example 8.49 then suggests 

8.51 Definition. Let f : I I be a continuous map, and let p be a fixed 
point of /. We say that 

(i) p is stable if Ve > 0 there is S > 0 such that \f^{x) — p\ < e Vn if 
\x —p\ < S, 

(ii) p is a sink or an attracting point if there exists S > 0 such that 
f^{x) p for all X with \x - p\ < S. If p is a sink, the basin of 
attraction of p is the subset of points on I whose orbits converge to 
P, 

(iii) p is unstable if p is not stable, i.e., if for some cq > 0 there exists a 
sequence {xk} such that Xk p, and for each k an integer rik such 
that |/”'‘(xfe) -p| > 6o, 

(iv) p is a source or a repelling point if for some eo > 0 and for each x 
such that 0 < |x -p| < 5 there is such that |/”“’(x) —p\> cq. 
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If p is stable and a sink, then p is said to he an asymptotically stable fixed 
point. 

Of course, by definition sources are unstable fixed points. Trivially the fixed 
point of a contraction on / is a stable fixed point and a sink, and its basin 
of attraction is the whole I. 

In Example 8.49, 0 is a source, 1/2 is a sink, and the basin of attraction 
of 1/2 is ]0, 1]. In general, we have 

8.52 Theorem. Let p be a fixed point for a continuous map f : I I, 
and assume that f is differentiable at p. 

(i) If \ f'{p)\ < 1, then p is a stable fixed point and a sink, that is, p is 
asymptotically stable. 

(ii) If \f'{p)\ > 1, then p is a source, in particular p is an unstable fixed 
point. 

Proof, (i) Since Mnix-^p = \ f'{p)\^ there is L < 1 and 5 > 0 such that 

\f{x) -p\ = \ f{x) - f{p)\ < L\x -p\<\x-p\ 

ii X e I and \x — p\ < 5. In particular, if \x — p\ < S, then If^(x) — p\ < S Vn, hence p 
is stable. Moreover 

|r+i(x)-p|<i|/"(x)-p| Vn, 
and, by iteration, we therefore conclude that 

\r(x)-p\<L^\x-p\, 
hence converges exponentially to p as n oo. 

(ii) Similarly to (i), there is L > 1 and > 0 such that \f{x) —p\ > L\x—p\ if |x — p| < S. 
If p were not a source, for any e > 0 we could then find x arbitrarily close to p such 
that \f^(x) — p| < e Vn. Choosing e = 5 we would find x such that \f'^{x) — p\ < 5 
Vn and setting <Jn := \f'^{x) — p|, crn < S Vn. But on the other hand by assumption 
c^’n+i > L(Tn Vn i.e., an — ^ oo: a contradiction. □ 

8.53 Remark. We notice that, if p is a fixed point of /, / is twice differ- 
entiable at p and f'{p) = 0, then the orbit {f^{x)} converges to p rapidly. 
In fact, in this case the second order Taylor formula yields cr^+i < Mcr^, 
(Tn := \r^{x) -pI see (8.15). 



d. Periodic orbits 

In dependence on the parameter a, the dynamics of the logistic map 
fa{x) = ax{l — x), X G [0,1], is significantly different. Observe that the 
map g{x) := ax{l — x), x e R, has two fixed points in R, 0 and (a — l)/a. 

For instance, if 0 < a < 1, then \fa(x)\ < a < 1 and fa is a contraction 
with the unique fixed point x = 0. For a = 1 the map fi still has 0 as a 
unique fixed point, and it is easy to check that 0 is a sink. For 1 < a < 3, 
fa has two fixed points: 0 that is a source, and (a — l)/a that is a sink 
with ]0, 1] as basin of attraction. 
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Figure 8.14. The iterates of f{x) := 3.3x(l — x). 



For a = 3.3, fa has two fixed points, 0 and 22/33, and both are sources. 
Therefore the orbits cannot converge. Numerical experiments show that 
the values of the iterates oscillate between two values of the order of 0.4794 
and .8236, see Figure 8.14, and moreover /(0.4794) = .8236 and /(.8236) = 
.4794, modulus roundings. This suggests that the two alternating values 
are fixed points of p. Actually, one easily proves that g{x) := / o f{x) 
has exactly three fixed points 0,pi,p2? Figure 8.15, with pi := 0.4794. . . 
and p 2 := .8236 . . . , and that both are sinks. The same holds if 3 < a < 
1 + ^ = 3.34494.... 



8.54 Definition. Let f : I ^ I be a continuous map. A fc-periodic point 
is a fixed point of /^, fc > 1, that is not a fixed point of p for any h < k. 
A fc-th periodic orbit is the orbit {P(p)} of a k-periodic point p. 

Of course, a k-th periodic orbit consists of k distinct points that are 
A;-periodic points. 

8.55 Example. The origin is the unique fixed point of f{x) = — x, x G [—1,1]; any 
other point is a 2-periodic point, since / o f{x) = — (— x) = x, and the 2-periodic orbit 
of X is the sequence {(— l)^x}. 

The stability of /c-periodic orbits can now be discussed exactly in the 
same terms in which we discussed the stability of fixed points. 

8.56 Definition. Let f : I ^ I be a continuous map on a closed interval 
I and let p be a k-periodic point of f. We say that the k-periodic orbit of 
p is a fc-periodic sink or a k-periodic attractor if p is a sink for the k-th 
iterate p of f. 

We say that the k-periodic orbit ofp is a periodic source ifp is a source 
for p. 

Finally we say that the k-periodic orbit of a k-periodic point p is stable, 
asymptotically stable, unstable if respectively p is a stable, asymptotically 
stable, unstable fixed point of p. 
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Figure 8.15. On the left f{x) = 3.3x(l — x) and on the right f^{x). 



8.57 Let f : I ^ I. An orbit {f^{x)} is said to be asymptotically k-periodic if 
\f'^(x) — f^{p)\ — )■ 0 as n — » oo for some fc-periodic point p. Show that p is a /c-periodic 
sink if and only if there is 5 > 0 such that {f^{x)} is asymptotically periodic for all x 
with \x — p\ < 6. 

Let p be a A;-periodic point of / that we assume to be differentiable at 
p. Denote by po? • • • iPk-h Po '= P-, the ^ distinct values of the orbit f^[p)\ 
since we have 

Dif^m = f{f-\p))fif-Hp)) • ■ • f{m)f{p) 

= f{Pk-l)f'{Pk-2) ■ ■ ■ f{Po), 
from Theorem 8.48 we infer at once the following. 

8.58 Theorem. Let f : I ^ I be differentiable. If po, . . . Po •= P, 

are the k values of a k-periodic orbit then 

(i) if |/'(po)/^(Pi) * * * /^(Pfc-i)l < the orbit is asymptotically stable, 
that is, the orbit is a sink and is stable. 

(ii) if \f'{po)f'{Pi)"'f'{Pk-i)\ > 1? the orbit of p is a source, hence 
unstable. 



8.59 Example. In the case f{x) = 3.3x(l — x), x £ [0, 1], none of the two fixed points 
0.4794 and 0.8236 is a fixed point of /. The corresponding 2-orbit is a periodic sink, 
since |/' (0.4794)/' (0.8236) | < 1, see Figures 8.14 and 8.15. 



e. Periodic-doubling cascade transition to chaos 

If we increase the parameter a in the logistic map, the dynamics becomes 
more and more complex. The so-called bifurcation diagram of fa is plotted 
in Figure 8.16. The diagram has been obtained printing the parameter a 
as abscissa, computing the iterates fai^o) and plotting on the line x = a 
the values of the fc-orbit of xq for 200 < k < 400. The values of xq and 
of fai^o) for 1 < fc < 200 have been neglected in order to eliminate the 
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2.6 2,8 3 3,2 3.4 3.6 3.8 4 



Figure 8.16. The iterates of the logistic map f{x) := Ax(l — x). 



dependence from the initial data as much as possible. The figure suggests 
that for a = 3.45, there is a 4-orbit which sinks: this can in fact be proved 
along the same lines as Theorem 8.58. As a increases, one gets an 8-orbit, 
a 16-orbit and actually an entire sequence of 2’^-orbits, n = 1,2,... which 

sink, until a reaches a limiting value aoo := 3.5699456 Such a sequence 

is called a periodic- doubling cascade and is one of the routes to chaos since 
in fact, for a > aoo orbits appear to randomly fill out the entire interval 
or a subinterval: they are quite complicated, hard to describe and quite 
irregular. But before that chaos, there is some regularity, which in fact is 
universal In fact, set fa := cif{x) for any continuous map / : [0, 1] [0, 1] 

with a unique maximum point, and /(O) = /(I) = 0, and denote by 
the value at which the n-th bifurcation occurs. Then it has been proved 
that there is a constant S, called the Feigenbaum constant, such that 

lim ~ = S = 4.66920161 .... 

n >oo a-fi-\-x a-jfi 

This is an example of order in the transition to chaos. 






Figure 8.17. The map f(x) := 4x(l - x) and the graphs of f{x), f^{x). 
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Figure 8.18. The intermittency phenomenon due to a thin channel. 



8.60 Example. The map f{x) = 4x(l — x), x G [0, 1], has two fixed points, x = 0, 3/4, 
but no sink. The map has four fixed points x = 0,pi,3/4,p2, two of which are the 
fixed points of / and the other two are 2-per iodic sinks f{p2) = Pi and f{p2) = Pi- 
The map /| has eight fixed points: 0, 3/4, the 2-periodic sinks are not fixed points 
for /|, the remaining six points form 2-orbits of period 3, see Figure 8.17. For more 
complicated orbits, compare Proposition 8.75. 



f. The intermittency phenomenon 

Consider the map 



. . _ fax if X € [0, 1/a], 

\^(l-x) ifxe]l/a,lj. 

For a > 1, one sees that / has no stable fixed point or orbit, and the 
orbits become quite complex; for a long period the process is regular, then 
the iterates oscillate around the unstable fixed point := a/ {2a — 1), for 

some time after which they go away and the process restarts again more 
or less similarly, see Figure 8.19. 

A similar phenomenon can be observed for the maps pa(^) •= q{'^ ~ 
in Figures 8.19 and 8.20. 

For a > ao, Xn quickly get close to a fixed point, while for a < ao 
and a ~ ao, the iterates will remain for some time in a “channel,” then 
they jump outside the channel, after which they move back in for a long 
interval which in general depends on the point at which the iterate enters 
the channel. 

In the formulas of logistic maps fa = ol x{l — x) and of the map above, 
when a varies, as we have seen we experience a transition from a regular 
regime to a “chaotic” regime: they may be regarded as two examples of 
transition to chaos. 
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Figure 8.19. The iterates of (a) g{x — 0.6) and of (b) g{x — 0.5) where g{x) = — 1/2 

until it reaches 1 and then decays linearly. 



g. Ergodic dynamics 

Let us consider the dynamical system 



Xn+l = Xn-\-UJ mod 1 (8.40) 

associated to the map : [0, 1] ^ [0, 1], Pijj{x) = x + a; (mod 1), that 
maps X into the fractional part of x + cj. 

Clearly, it can be regarded as a dynamical system on the circle S^. 
In fact, if we identify R/Z with with the map t — ^ exp (z27rt) and set 
Zn := we have 



z := 

In other words, is the point Zn rotated counterclockwise to the angle 
27TUJ, 

If LJ is rational, ^ = p/q^ P^Q coprime, the orbits of the system are all 
g-periodic and consist of a finite number of points Xj := x ^ mod (1). 





2 

1.5 - 

oi^|4/VlA 

-0.5 - 

I I I I I I I 

0 10 20 30 40 50 60 






Figure 8.20. Here g{x) = e® — 1/2 until it reaches 1 and then decays linearly. Prom the 
left, the iterates of g{x — 0.6), g{x — 0.5), and of g{x — 0.45). 









346 



8. Discrete Processes 



8.61 Theorem (Jacobi). If uj is irrational, all orbits of are dense in 

[ 0 , 1 ]. 

Proof. Let lj be irrational and let x G [0,1]. First we notice that the 
points {(f'^{x)} are distinct. In fact, if (pZ{x) = then (n — m)uj G 

Z, consequently n = m, u being irrational. The infinite distinct points 
of an orbit admit a convergent subsequence, by the Bolzano- Weierstr ass 
theorem, hence for any e > 0, we can find two distinct elements and 

such that ~ (^)l < Since ip^^ preserves the distances on 

the circle, we deduce 

- ip^{x)\ < € forp=|n-m|, 



consequently 

0 < |(^(a:) - ip^x)\ = - ^t’^{x)\ < e. 

We conclude that the sequence (p^{x), (p^{x), . . . divides in arcs 

of length uniformly bounded from below and not larger than e: this shows 
that the orbit of x is dense. □ 

The dynamics of (8.40), though quite complex, has relevant regularity 
properties. The time mean of a continuous function / : [0, 1] ^ R or 
/ : [0, 1] — > C, also called an observable along the orbit, is defined as the 
limit 

rix) 

n=l 

if it exists, while its mean is called the phase mean 

1 

7 := j f{x)dx. 

0 

8.62 Theorem (Ergodic theorem). Letu be irrational. Then the limit 
(^* (x) exists for all x and (x) = 

In other words, the time mean of an observable along the orbit is in- 
dependent of the orbit itself, equivalently on the initial value, and equals 
the phase mean. 

Theorem 8.62 can be obtained as a consequence of the Hermann Weyl 
(1885-1955) theorem on the uniform equidistribution of the fractional 
parts of cj, 2uj, 3u, . . . , no; for n large, if uj is irrational. 

8.63 Theorem (Weyl). Let uo be an irrational. 



Actually one refers to a dynamics associated with a map / : [0, 1] — »■ [0, 1] as to 
an ergodic dynamics if f*(x) = f for almost every point x G [0, 1] in the sense of 
Lebesgue. 
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(i) For every continuous and 1-periodic function, we have 



N 

(ii) IfO < a <b <1, then 

#{n \ 1 < n < N, nuj e[a, 6]} 
N 



N 

^ [ mat. (8.41) 

n=l i 



b — a 



where, we recall, #A denotes the number of elements of A. 

The proof of the Weyl theorem we present here uses the following density result 
that we do not prove. 



8.64 Theorem. Trigonometrical polynomials in [0, 1] are dense in the class of con- 
tinuous and 1-periodic functions, i.e., given a continuous function / : [ 0 , 1 ] — ^ M with 
f(0) = /(I), and a positive e > 0, there is a 1-periodic trigonometric polynomial such 
that 

\f{t)-P(t)\<e Vte[0,l]. 

Proof of Theorem 8. 63. (i) For increasingly complex / we shall prove that 

N ^ 

GnU) ~ /(”^) - j /(*) dt-*0 as N ->oo. 

n=l 0 



1 r 

(a) If f(t) = 1, then clearly GnW = ^ ^ J 

1 0 

(b) Suppose f(t) = exp(227rfct), A; 6 Z, fc 7 ^ 0, so that Jq f(t)dt = 0. Since lj is 
irrational, exp{i27rkuj) ^ 1 , hence 



|Gjv(/)| = ^ 



N 



E 



^i 2 'Knkuj 



N 



N-l 

gi27rfca; E ^i2'nnku 

0 



1 , 1 „i2'KNsu} 

— _L»27rfetj£^Ef 

jy I \ — gi27rfcu; 



< 



1 2 
~N |l 



— ♦ 0 as AT — > 00. 



(c) Suppose now that / is a trigonometric polynomial of period 1 , P{t) = 

J2k=-p We have Gn{P) = Z) CfeC?iv(exp {i27rkt)) hence (ii) yields Gn{P) — ^ 0 

as iV — > 00 . 

(d) For a continuous and 1 -periodic function / : E — > C, given e > 0, by the density 
theorem, Theorem 8.64, we find a trigonometric polynomial P{t) such that \f{t) — 
P{t)\ < e V t. Thus 

\GN{f)-GN{P)\<2e \/N,^t. 

According to (c), we can find No such that |Giv(-P)| < e for all N > No . Therefore, we 
conclude for N > No, 

\GN{f)\ < \Gn{P)\ + \GN(f) - Gn{P)\ < 3e, 
i.e., Giv(/) — 0 as A — > 00 . 

(ii) Given e > 0, let /_ and /+ be two continuous functions such that 
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Figure 8.21. Aleksandr Lyapunov (1857- 
1918). 



/-(^)<1</+W VtG[a,6], 

/-(<)= 0> /+W>0 Vt^[a,6], 



(b — a) — e < 




j f+dt < {h - a) e. 
0 



Trivially 



N N 

f-{nuj) < #{n I 1 < n < TV, no; G [a, 5]} < 



1 1 
On the other hand, by (i), for N > Nq = No{e) we have |Gat(/+)|, C?iv(/-)| < e, hence 



i 

Jf-dt- 



6 < 



#{n I 1 < n < AT, no; G [a, b]} 
N 



< j /+dt + e 

0 



and therefore 






We notice that the conclusions of Theorem 8.62 would be false if lj were rational. 
Therefore Theorem 8.62 charcicterizes the irrationals as the reals u for which either 
(8.41) holds or the fractional parts of {no;} are equidistributed. 



8.2.3 Chaotic dynamics 



Let US discuss now some of the characteristic features of chaos. 
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Figure 8.22. Frontispieces of two stimulating books. 



a. Sensitive dependence on initial conditions and the Lyapunov 
exponent 

8.65 Definition. Let f : I I be a map defined on a closed interval 
/ C R. We say that a point xq G / has sensitive dependence on initial 
conditions or is a sensitive point if there exist cq > 0 and sequences {xk} 
and {rik} such that Xk xq and \f^^{xk) — f^^{xo)\ > Co- 



lt is not difficult to show that sources of any power of / are sensitive points. 

In the case in which points near xq become separated by the action of 
the map /, a measure of such a separation is provided by the Lyapunov 
exponent. If the separation is exactly exponential, that is — f^{xo\ = 

q'^\x — xo\y the Lyapunov number is q. In the general case, the Lyapunov 
number at xq is defined by 



L{xo) := lim limsup 

^ X — >Xq 



f \r{x)-r{xo)\ 
V \x - Xo\ 



1/n 



and the Lyapunov exponent, 

\{xo) := logL{xo), 



is a measure of the (exponential) separation of the orbits starting close to 
Xq, provided, of course, the limit as n ^ oo exists. 

If / is differentiable near xq, the Lagrange mean value theorem yields 



L(xo) := lim 

n—*oo 



in'ixo) 



l/n 



or, since (/")'(xo) = f{xi), Xt /*(xo), 
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Figure 8.23. Bernoulli’s shift. 



LIxq) := lim 

n-^oo 






and consequently 

A(a;o) = logL(xo) := lim - Vlog |/'(xj)|. 

n— >oo 71 ' 

i=l 

In particular, the Lyapunov exponent A(a:i) of a fixed point Xi of a 
smooth function / is log|/'(xi)|, while the Lyapunov exponent of a k- 
periodic orbit, at each of the values of the orbit, Xi, X 2 , . . . ,Xfc, = 
f(xi), Xi 7 ^ Xj Vi 7 ^ j, Xk = xi, is 

1 ^ 

A(xi) = A(x 2 ) = ••• = A(xfe) = -^log|/'(xi)l. 



8.66 Proposition. Let f : I ^ I be of class and let {x^}, {Vn}, 
Xn = Vn = f^{y) be the orbits of x and y respectively. Suppose 

that 

(i) {xn} and {t/n} asymptotic to each other, \xn — yn\ ^ 0 as n — > oo, 

(ii) f'{xn) ^ 0, /'(yn) ^ 0 Vn, 

(hi) |/'(xn)| ^ A G R. 

Then the Lyapunov exponents of f at x and y exist and X(x) = A(y) = A. 
Proof. In fact, Cesaro’s theorem (see Example 2.56) yields 

A(x):= lim i ^log |/'(f (x))| = lim log |/'(/"(x))| = A. 

n^oo n n—^OD 

i=l 

Since the two orbits are asymptotic. 
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lim log !/'(/" (x)) I = lim log |/'(/"(y))l 

n-^oo n—*oo 

hence 

\{y):= lim - ^log |/'(/*( 2 /))| = lim log |/'(/"( 2 /))| = A. 

n— >00 Ti n—^oo 

i=l 

□ 



b. Chaotic orbits 

8.67 Definition. Let /:/->/ be a smooth map. We say that the orbit 
{xi, X 2 , . . .} of f is chaotic if 

(i) {xi, X 2 , . . .} is bounded, 

(ii) {xi,X 2 , . . .} is not asymptotically periodic, 

(hi) the Lyapunov exponent X{xi) is positive. 

More complex and with less unanimous agreement is the definition 
of chaotic dynamical system. Usually, the presence of bounded orbits is 
required, with exponential separation and density to grant irreducibility, 
that is, that we are not in the presence of two assembled independent 
dynamical systems. We now illustrate a few examples. 

c. Bernoulli’s shift 

Let us consider the process = cr{xn) associated to the map 
cr(x) = 2x mod (1), x G [0, 1], 

see Figure 8.23. In order to describe its action, it is convenient to work 
with numbers in [0, 1] in their binary representation. We write 

oo 

X = ^2 = 0, aia2as . . . 

2=1 

where has the value 0 or 1. For x < 1/2, we have ai = 0 while x > 1/2 
implies ai = 1. Therefore 



(j(x) = 



2x 

2x-l 



if ai = 0, 
if ai = 1, 



or 

(j(0, aoa\a2 . . . ) = 0, aia2as . . . , 

that is, the action of a on the binary representation of x is to delete the 
first digit and shift the remaining sequence to the left. 

The process a shows sensitive dependence on the initial condition. If 
two numbers differ starting only from the n-th digit, such a difference 
becomes amplified under the action of by 2'^: the n-th iterates differ 
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Figure 8.24. The triangular map fi{x) and its third iterate /f (x). 



in the first digit. More precisely, we compute the Lyapunov exponent at a 
point X whose orbit never goes through 0, 1/2, 1 as 

n 1 ^ 

A(x) = lim Y'log|/'(xi)| = lim log 2 = log 2. 

n—^oo ' n— >00 Tl 

i=l 2=1 

On the other hand periodic points are the numbers with a repeating 
binary representation, and the asymptotically periodic orbits are those 
with an initial point with a repeating binary representation starting from 
a suitable digit. Therefore an asymptotically periodic orbit starts at x if 
and only if x is rational. Hence the orbits starting from an irrational are 
bounded, are not asymptotically periodic and have Lyapunov exponent 
log 2 > 0. So we conclude 

8.68 Theorem, x G [0, 1] has a chaotic orbit under Bernoulli's shift if 
and only if x is irrational. 



One can also prove the following properties of Bernoulli’s shift cr: 

(i) The sequence of the iterates has the same random properties as the suc- 

cessive tosses of a coin. In fact, a sequence of coin tossings is equivalent to the 
choice of a point in [0, 1], 

(ii) One can show^^ that almost all^^ irrationals in [0, 1] contain in their binary 
representation any finite sequence of digits infinitely often and uniformly distribu- 
ted, i.e.. 



1 

N 




F{t) dt. 



Bernoulli’s shift contrasts with the additive process we discussed before. 



Xn+i = Xn + ^ mod 1, (jj irrational, (8.42) 

See, e.g., G.H. Hardy, E.M. Wright The theory of numbers Oxford University Press, 

Oxford 1938. 

in the sense of Lebesgue. 
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Figure 8.25. The triangular map fi{x) and its third iterate /f (x). 



that we cannot qualify as chaotic. In fact, it has no periodic orbits and 
consequently no asymptotically periodic orbit, each orbit is dense, but the 
Lyapunov exponent of each orbit is zero. 

8.69 Definition. A bounded orbit that is not asymptotically periodic and 
does not show sensitive dependence on the initial data is called almost- 
periodic. 

The orbits of (8.42) are almost periodic. 



d. The triangular map 

Consider the family of triangular maps 



fr{x) := 



2rx if X < 1/2 

2r(l — x) if X > 1/2 



a: € [0, 1]. 



For r < 1/2, X* = 0 is the only stable fixed point to which all points in 
[0, 1] are attracted. For r > If 2 two unstable fixed points (sources) emerge, 
and the behaviors of the process for 1/2 < r < 1 and r = 1 are similar. 
The map fi shows sensitive dependence on initial conditions. In fact, the 
n-th iterate of fi is piecewise linear, with slopes ±2'^ except at the points 
j • 2“^^, j = 0, 1, . . . , 2^. Consequently the separation of “almost all points” 
xq grows exponentially with Lyapunov exponent A(xq) = log 2. In general, 
the Lyapunov exponent of fr is for almost all points xq, 

A(xo;/r) = log2r, 

and, for r > 1/2, we have A(xq, fr) > 0, that is we “lose information” on the 
position of xq after n iterations, while for r < 1/2, we have A(xo,/r) < 0 
and we “gain information” as the iterates of xq converge to 0. 

On the other hand, one can show that the periodic orbits do not attract 
any orbit, inferring this way the following. 
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8.70 Proposition. The triangular map fi has infinitely many chaotic 
orbits. 



e. Conjugate maps 

8.71 Definition. Two continuous maps f and g are said to be conjugate 
if there is a continuous and 1-to-l continuous change of variable C such 
that C o f = goC. 



8.72 Proposition. Let f and g be conjugate by C. If x is k-periodic for 
/, then C{x) is k-periodic for g. If moreover /, g and C are of class 
and never vanish along the k-periodic orbit of x, then 

(5*=)'(C(x)) = (fy(x). 

8.73 Proposition. The triangular map fi and the logistic map g{x) = 
4x{l — x) are conjugate. 

8.74 f. Show Proposition 8.73. [Hint: For x £ [0, 1/2] choose C{x) := izicosjra j 

Taking into account Propositions 8.72 and 8.73 one could also show the 
following. 



8.75 Proposition. Let g{x) := 4x{l — x) be the logistic map. 

(i) All periodic points of g are sources. 

(ii) g has chaotic orbits. 



8.76 f f. 

all F 



Show that the process associated to the logistic map g is ergodic and for 



i- 

N 



1 n 



m 

Tiy/t — 



dt. 



8.77 f. Show that 

(i) the maps {a + l)a: — ax^ and + c, c := , are conjugate, 

(ii) the maps ax{l — x) and + c, c = ^(1 — are conjugate. 

[Hint: (i) (p{x) := — ax, (ii) (f{x) ^ — ax] 



8.2.4 Chaotic attractors, basins of attraction 

Let US start with a few definitions. 

The forward limit of x is the set of points the orbit converges to, that is, 
the set of points to which the orbit with initial condition x comes infinitely 
often arbitrarily close to, i.e., 

w(x) := jyl liminf |/"(x) -y| =o|. 

I I n — >oo J 

The following is trivial: 
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Figure 8.26. (a) The map f(x) = ^ arctanx. (b) The map in Example 8.80, 



(i) if {f^{x)} converges to p, then u;{x) = {p}, 

(ii) if X is a fixed point, u{x) = {a:}, actually, if uj{x) contains a stable 
fixed point x, then oo{x) = {x}, 

(hi) if p 6 ct;(x), then f'^{y) G uj{x) Vn > 0, 

(iv) if X is a fc-periodic point, then uj{x) is the set of values of the periodic 
orbit originating from x. 

When the forward limit is a set of fixed points or of periodic points, then 
it is often called the stable manifold. 

8.78 Definition. Let uj{x) be the forward limit of a point x. We say that 
Lj{x) attracts y if u{y) C o;(xo). The basin of attraction of uj{x) is the set 
of points which are attracted by uj{x). We say that co(x) is an attractor if 
it attracts a substantial number of points^ more precisely if it attracts a 
set of points of positive Lebesgue measure. 

We then say that u{x) is a chaotic set if the orbit of x is a chaotic orbit 
and X e uo{x). Finally u{x) is a chaotic attractor if uj{x) is both a chaotic 
set and an attractor. 



8.79 Example. The map 

4 

f{x) = — arctanx 

7T 

has three fixed points —1,0 and 1. Prom Figure 8.26 we see that —1, 1 are sinks, while 
0 is a source. The attractors are therefore {— 1}, {!}, and their basins of attraction are 
respectively the negative half-line and the positive half-line. 



8.80 Example. Consider the map in polar coordinates 

/(r, 0) := (r^, 0 - sinO) r>0, O<0 <2n. 

It has three fixed points, (0,0), (1,0) and (1,7t). The origin and infinity are two at- 
tractors: iterations move every interval point of the unit disk to the origin and every 
extremal point to infinity. On the circle the dynamics moves all points but (1,7t), that 
is a source, to (1,0), see Figure 8.26. 
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Figure 8.27. (a) The triangular map with slope 3. (b) The map W. 



8.81 Example (The triangular map with slope 3). The following example shows 
that the basin of attraction of an attractor can be quite a complex set. Let us consider 
the triangular map / : M — ^ R in Figure 8.27 (a). Clearly, the orbits with initial condi- 
tions either in ] — oo, 0[ and ]1, oo[ converge to — oo. Also orbits with initial conditions in 
] 1/3, 2/3 [ converge to — oo since the first iteration maps this interval in [1, oo[. It should 
then be clear that the basin of attraction of — oo under the map / is the complement 
in E of the Cantor middle-third set defined below. 

8.82 Example. Let us consider the map f{r,0) := (r^/^,20). The origin is a source 
and a;(0) = 0. Taking into account that (1, 29) describes Bernoulli’s shift on 5^, we see 
that the unit circle is the forward limit set of an orbit with Lyapunov exponent log 2 
for almost all initial points in the circle, consequently the unit circle is a chaotic set. 
Finally all points but the origin are attracted to the unit circle; we therefore conclude 
that the unit circle is a chaotic attractor. 



8.83 Example (The W map). Let us consider the piecewise linear map / : [0, 1] — > 
[0,1] in Figure 8.27 (b). Restricting our attention to the interval [1/4, 3/4], the map 
acts as the triangular map with slope 1, while the points in [0, 1] \ [1/4, 3/4] are mapped 
by the first iteration in [1/4, 3/4]. We therefore conclude that [1/4, 3/4] is a chaotic 
attractor. 



8.84 Example (The baker’s map). Let us consider the area-preserving map given by 



Xn+l = 2Xn (mod 1) 



2/n+l — 



f l/2yn 

\l/2 + l/2yn 



if 0 < Xn < 1/2, 
if 1/2 < Xn < 1, 



and illustrated in Figure 8.28. Since the first component is Bernoulli’s shift, we easily 
conclude that the entire square is a chaotic attractor. 




Figure 8.28. The baker’s map. 
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Figure 8.29. The dissipative baker’s map. 






a 



a 



8.85 Example (The baker’s dissipative process). Consider now the process, sim- 
ilar to the baker’s process, 



Xn-\-l = 2xn (mod 1), 



2/n+l — 



ayn 

1/2 -f- ayn 



if 0 < Xn < 1/2, 
if 1/2 < Xn < I, 



where a < 1/2, see Figure 8.29. This process is dissipative, that is, it does not preserve 
the area. The baker’s dissipative process has still a chaotic attractor which is made now 
by a huge set of horizontal lines. As we shall see below, compare Example 8.99, this 
strange attractor is a fractal, in the sense that its “dimension” is strictly between 1 and 
2 . 



8.86 Example. A process that is very similar to the baker’s dissipative process is the 
one associated to the map 



f{x,y) 



(|a:,22/) if 0 < 3/ < 1/2, 

(5®+ 1), if 1/2 < 3/ <1. 



(8.43) 



The first iteration maps the square into the first and last third of the square, as shown 
in Figure 8.30. The figure shows also the second iteration. Clearly the attractor of the 
full square is again a chaotic attractor which is a strange attractor, being a set of lines 
which has a “noninteger dimension.” 



8.2.5 Cantor sets and other self-similar sets 

a. Measure and dimension 

8.87 Box counting dimension. A simple way to compute a “dimen- 
sion” of a bounded subset of is the following. Assume that A is con- 
tained in a rectangle R. Then divide each side of R in 2^ pieces, thus 




Figure 8.30. The first two iterations of the process in Example 8.86. 
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Figure 8.31. 



dividing R into 4^ rectangles. Let N{k) be the number of rectangles which 
touch A. If A is a line, then for k large N{k) is of order 2^, while if A has 
an interior part, then N{k) is of order 4^, thus it is reasonable to set the 
box counting dimension of A at scale 2“^ as 

log 2 N{k) _ log N{k) 
k fc log 2 

and the box counting dimension as 

dimB(yl):=£m (8.44) 

provided the limit exists. 

8.88 HausdorfF measure and HausdorfF dimension. Usually, dimen- 
sion is associated to changes in measure under dilations: under a dilation 
in of factor A, points do not change, segment and curve lengths are 
multiplied by A, square and surface area are multiplied by A^, while vol- 
umes are multiplied by A^. So another possible definition of dimension goes 
through the definition of a measure that scales suitably under dilations. 

There are many different measures in R^, n > 1, that are invariant 
under translations, yield the same measure for regular sets, and scale with 
a given power less than n under dilations. Among the many possibilities, 
the Hausdorff measure^ introduced in 1918 by Felix Hausdorff (1869-1942), 
is sufficient and suited to our purposes. 

Let us describe the Hausdorff measure in R^, n > 1. It is usual to 
define for any nonnegative real number 5 > 0, 

7,s/2 

r being the gamma function, since, as one can show, for integral values of 
5, cjs is the volume of the unit ball in W. 



8.89 Definition. Let A C R’^. For any 5 > 0 we define 






k 



Ufe Ck Z) A, 



{Ck} axe n-balls with diamC'fc <6, Ck C R"|. 
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The spherical Hausdorff measure of A is then defined by 
n^A) := liinniiA), V ^ C 

We notice that, is well defined, since is increasing with 5; more- 
over we notice that the naive definition 

I ACU,C,} 

would not work, see Figure 8.31. 

It is easily seen: 

(i) H^{XA) = X^H^{A), A > 0, where A^l = {Xx \ x e A}, 

(ii) TiP{A) = is the measure that counts the elements of A in R^ 

(iii) if 5 > n, then TL^{A) = 0 VA C R’^, 

(iv) if / : R’^ ^ R’^ is Lipschitz-continuous, then W{f{A)) < 

Lip{fyn^{A), 

(v) is invariant under translations and rotations. 

Finally, we notice that 

nUA)<(J^ nt(A), if0<s<r, 

that is, if 0 < s < r, then 

(a) H^{A) = Aooifn^{A)>0, 

(b) wIa) = 0 if n^(A) < oo. 

In particular H^{A) may be positive and finite only for one value of s. 

8.90 Definition. The Hausdorff dimension of A C is then defined as 

A := inf|s > 0 H^{A) = o|. 

From the previous remarks, one easily infers 

dim^^ A = supjs > 0 W{A) = +oc| 
and that dimT^ ^ = 5 if 0 < H^{A) < oc. 

8.91 Definition. Sets in R^ with nonintegral dimension are called frac- 
tals. 
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Figure 8.32. The sets Eq, Ei, E2 and E3 in the construction of the Cantor set C1/3. 



b. Cantor sets 

Cantor sets are a family of subsets of R among which the Cantor middle- 
third set is a prototype. They are obtained as follows. Choose 5 g] 0, l/2[, 
((5 = 1/3 for the Cantor middle-third), and set Eq = [0,1]. In the first 
step we define E\ by removing from £*0 an open interval, centered at the 
middle point, of length 1 — 25. E\ is then the union of two intervals of size 
5. By induction, we define £fc+i by removing from each of the intervals of 
Ek a centered interval of length 5^{1 — 25). This way we get a decreasing 
sequence of sets {Ek}, Ek being the union of 2^ intervals of size 5^. The 
Cantor set associated to 5 is defined as 

00 

Cs := fl Ek. (8.45) 

k=0 

It is not difficult to show the following. 

8.92 Proposition. The Cantor middle-third set C 1 / 3 , corresponding to 
5 = 1/3, consists of all numbers in [0,1] that have a ternary expansion 
involving only the digits 0 and 2. 

Another way to look at Cantor sets is useful. Consider the two maps, 
actually two contractive similitudes, 81,82 : [0, 1] — » [0, 1] given by 

8i{x) := 5x, 82 {x) := 5x 1 — 5, 

and for any set A C [0, 1], set 8 (A) := 8i{A) U 82(A). 

Then observe that the sets [Ek] in (8.45) are actually produced by the 
following dynamical system acting on sets, 

iEo:= [0,1], 

[Ek^i := 8{Ek), 



i.e., 

Ek := SoSo---oSm 1]) =: 5*=([0, 1]) 

k 

from which, taking also into account that £^+1 C Ek Vfc, we infer 
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Ci = Pi = Pi S{Ek) = (since Ek+i C Ek) 

fc=l fe=0 

= 5(n^=o^fc) ='5(C5). (8.46) 

8.93 Definition. A system S := (5i, ^2 , . . . , Sn) of N contraction maps 
on is called an iterated function system, an IFS for short. A set C C 
such that 



N 

C'=|j5i((7), and Si{C) <1 Sj{C) = (D Vi j 

i=l 

is called a self-similar set. 

8.94 Remark. The terminology becomes clearer when (^i, 52, , Sn) 
are contractive similitudes. Assuming for instance N = 2^ and that Si and 
S 2 contract by a factor conditions 

C = Si{C) U 52(C), 5i(C) n 52(C) = 0 (8.47) 

say that C is the union of two pieces which are each a scaled down copy 
of C itself, that is C is self- similar at the scale 6. Moreover from (8.47) we 
also have 

C = (5i O 5i(C)) U (5i O 52(C)) U (52 O 5i(C)) U (52 o 52(C)) , 

5i O Sj (C) n Sh o Sk{C) = 0 V(i, j) ^ {h, k), 

that is, C is also the union of four pieces that are scaled down versions of 
C of factor 5^. Proceeding by induction, for any k, C is also the union of 
2^ pieces each of which is a scaled down copy of C by a factor 8^. This is 
self-similarity. 



8.95 . A more explicit description of the Cantor set is the following one. If the base 
points of the Cantor set are defined by induction by 



00,1 = 0 , Ok+l,i = < u 



then 



are the intervals to be deleted from E^-i to get at the fc-step, and 
Jk,3 :=6fcj+5''[0,l], j = l,...,2^ 

are the intervals whose union is Ek- We have set a[a, b] := [aa, ab]. Consequently 

00 /2^ 



= n(u 4 .> 

k=0 ^j = l ' 
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Figure 8.33. Helge von Koch (1870-1924) and Waclaw Sierpinski (1882-1969). 



This shows again that C is self- similar, and more precisely that 

bk,j+d^Cs = CsnJk,j. (8.48) 

In fact, 

X G Jh,j if and only if 5^x G Jh-\-k,j, V h,k,j, 

hence 

5^ Eh = Eh-^k n [0, S^], i.e., bkj + S^Eh = Eh-\-k H Jkj, 

for all h, > 0, ji = 1, . . . , 2^; and (8.48) follows by taking into account the intersection 
on h. 



c. Iterated function systems 

Self-similarity is even more evident in the two-dimensional Cantor set, 
known also as Sierpinski ’s square, obtained by dividing the unit square in 
3’^ squares and removing the central square from each of the squares left at 
the n-th, see Figure 8.34, or as the so-called Sierpinski’s carpet, obtained 
by removing the central cross (see, for example. Figure 8.37), or in the 
Sierpinski’s gasket (see Figure 8.35). 

Another “1-dimensional” example is the von Koch curve; compare Fig- 
ure 6.23. The curve of von Koch, contrary to regular curves and similarly 
to the examples above, has the property that any enlargement or blow-up 





Figure 8.34. The first steps in the construction of Sierpinski’s square. 
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Figure 8.35. Another step in the construction of Sierpinski’s square. 



of it does not simplify, instead it leaves the complex structure unchanged. 
This may be presented as an informal definition of self-similarity. 

All these examples are defined following the same pattern. One starts 
with an IPS system of contractions (Si, . . . , Sjv) on R’^, and, as proved by 
Felix Hausdorff (1869-1942), one can show (but we shall not do it here) 
that there is a unique nonvoid, bounded and closed set C such that 

C = ug,Si{C). 

We refer to it as to the invariant set of the IPS (Si, S2, . . . , Sjv). C is in 
fact a fixed point for the map A U^iSi(A) on the so-called Hausdorff 
space. Moreover, introducing a suitable notion of “distance between sets,” 
C can be found as the limit of the sequence of sets {Ffc} defined by 

{ Fq an arbitrary closed bounded and nonvoid set, 

Ffe+i := ul,Si{Fk). 

The reader will recognize the iterative procedure as the same defining 
Cantor sets C R. 

In general the invariant set is not self-similar, but it is under suitable 
sufficient conditions. 



d. Dimension of the invariant set 

We restrict ourselves to an IPS 5i, S 2 , ••• -,3 n of contractive similitudes 
|Sj(x) - Si{y)\ = Li \x-y\ Vx, y G E”, z = 1, . . . ,iV. 





364 



8. Discrete Processes 







Figure 8.37. The first steps in the construction of Sierpinski’s carpet. 



We define the geometric dimension of that IFS as the unique nonnegative 
real number D such that ^ 

= ( 8 - 49 ) 

i=l 

In general the geometric dimension has no relation with the more geometric 
definitions of dimension. However, we have the following. 

8.96 Proposition. If the invariant set C of an IFS of contractive simili- 
tudes is self-similar and for some s we have 0 < H^{C) < +oo, then s is 
the geometric dimension of the IFS. 

Proof. In fact, 



N N 

n%c) = = w{c)Y,Li 

i=l i=l 

i.e., Lf = 1, since 0 < H^{C) < oo. □ 

Of course, going in the opposite direction is more useful. We state 
without proof the following theorem which gives a full description of some 
IFSs of contractive similitudes. 

8.97 Definition. One says that an IFS of contractive similitudes satisfies 
the open set condition if there exists an open set Q such that 






Figure 8.38. Three iterates going to von Koch’s curve starting with the segment of 
vertices (0,0) and (1,0). 
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/\ V ^ i<i\ 



Figure 8.39. The second, third, and fifth iterate going to von Koch’s curve starting with 
the segment of vertices (0, 1) and (1,0). 



\5i(n)n5,(fi) = 0 



8.98 Theorem. Let (^i, 52, . . . , 5iv) be an IFS of contractive similitudes 
which contract respectively of . . . Ln, let C be the invariant set of the 
IFS, and let d be the geometrical dimension of the IFS, — 1- 

If the IFS satisfies the open set condition, then 

(i) 0 < H^{C) < + 00 , hence dim 7 ^(C) = d, 

(ii) H^{Si{C)nSj{C)) = o\^i^j. 

Moreover, each piece of C has the same dimension, and d is also the box- 
counting dimension of C}^ 

Theorem 8.98 yields a way to conclude that the invariant set of the IFS 
of contractive similitudes that satisfy the open set condition is essentially 
self-similar since (7 is a union of N scaled down copies of C itself that can 
overlap but in a nonessential way, as the intersections have zero measure. 
As a by-product we have a formula, the geometric dimension, to compute 
effectively the dimension of C. 



8.99 Example (Cantor set in M). As we have seen, this is the invariant set of the 
two contractive similitudes of R, 



Si{x) = 5x, S 2 (x) = -f- 1 — 5. 

{S\,S 2 ) satisfies the open set condition, H being the open interval ]0, 1[. Therefore the 
Cantor set is self-similar and by Theorem 8.98 has dimension d given by 



25 * 



d _ 



i.e., d = d{5) = 



log 2 

log(l/d) ■ 



Notice that 0 < d{5) < 1 being that 0 < 5 < 1/2. See Figure 8.32 for the first iterations 
starting from the segment [0, 1]. 



See, e.g., J. E. Hutchison, Fractals and self- similarity, Indiana Univ. Math. J. 30, 
713-747 (1981). 
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Figure 8.40. The first three iterates going to Sierpinksi’s gasket starting from the triangle 
of vertices (0,0), (1,0) and (0, 1). 



8.100 Example (Sierpinski’s gasket). This is the invariant set for the IFS of con- 
tractive similitudes of defined by 



Si 



X 

y 




S2 



X 

y 




S3 



X 

y 




x /2 

2//2 

x/2 

2//2 

x /2 

2//2 




1/2 \ 
1/2 )’ 

vO’ 



(5i, 52, S 3 ) satisfies the open set condition, Q being the open triangle of vertices (0, 0), 
(0, 1) and (1,0). By Theorem 8.98 Sierpinski’s gasket is essentially self-similar and has 
a nonintegral dimension d given by 



3 




= 1 , 



i.e.. 



_ log 3 
log 2 



> 1 . 



See Figure 8.40 for the first iterations starting from the triangle Eq of vertices (0,0), 
(0,1) and (1,0). 



8.101 Example (Sierpinski’s square). This is the invariant set for the IFS of eight 
contractive similitudes of defined by 





Figure 8.41. The first three iterates going to Sierpinksi’s square starting from the square 
of vertices (0,0), (1,0), (1, 1) and (0, 1). 
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□ □ 
□ □ 



Figure 8.42. The first three iterates going to a Sierpinski’s carpet that contracts to 1/4 
starting from the square of vertices ( 0 , 0 ), ( 1 , 0 ), ( 1 , 1 ) and ( 0 , 1 ). 



where (a,, 6 i) is one of (0,0) (1/3,0), (2/3,0), (0,2/3), (1/3, 2/3), (2/3, 2/3), (0,1/3), 
( 2 / 3 , 1 / 3 ). ( 5 i, 52 , ... , 5s) satisfies the open set condition, Q being the open square of 
vertices (0,0), (0, 1) (1,0) and (1, 1). By Theorem 8.98 Sierpinski’s square is essentially 
self-similar and has a nonintegral dimension d given by 



8 




= 1 , 



i.e. 



_ logs 

log 3’ 



Notice that 1 < d < 2. See Figure 8.41 for the first iterations starting from the rectangle 
Eo of vertices ( 0 , 0 ), ( 0 , 1 ), ( 1 , 1 ) and ( 1 , 0 ). 



8.102 Example (Sierpinski’s carpet). This is the invariant set for the IFS of four 
contractive similitudes of M? defined by 






X 

y 







hi 



where q < 1/2 and (ai,bi) is one of ( 0 , 0 ) (1 — 9 , 0 ), ( 0,1 — q) and (1 — 9, 1 — 9 ). 
{Si, 52, . . . , 54 ) satisfies the open set condition, Q being the open square of vertices 
(0,0), (0,1) (1,0) and (1,1). By Theorem 8.98 Sierpinski’s carpet is essentially self- 
similar and has dimension d given by 



V = 1, 



Iog(l/4) 

log? 



Notice that for 9 = 1/4, we get a set of dimension 1. See Figure 8.42 for the first 
iterations starting from the rectangle Eq of vertices (0,0), (0, 1), (1, 1) and (1,0). 



8.103 Example (Snowflake). This is the invariant set for the IFS of nine contractive 
similitudes of defined by 



y ) 2 { y J I 1/4 



and for i = 2, . . . , 9, 




where (a*, 6 i) is one of (0, 0) (0, 7/8), (7/8, 0), (7/8, 7/8), (1/8, 1/8), (1/8, 3/4), (3/4, 1/8), 
( 3 / 4 , 3 / 4 ). (Si, S 2 , . . . , Sg) satisfies the open set condition, Q being the open square of 
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vertices (0,0), (0, 1) (1,0) and (1, 1). By Theorem 8.98 the snowflake is essentially self- 
similar and has dimension d given by 

1- (i)‘* + 8(i)‘^ = l, i.e., d = 1.25996.... 

See Figure 8.43 for the first iterations starting from the rectangle Eq of vertices (0,0), 
(0,1) (1,1) and (1,0). 



8.104 Example (von Koch’s curve). This is the invariant set for the IFS of con- 
tractive similitudes of M? defined by 




where R{0) denotes the rotation matrix by an angle 9 measured counterclockwise 



R(e) = 




— sin 6 
cosO 



{Si, S 2 , S 3 , S 4 ) satisfies the open set condition, Q being the open triangle of vertices 
(0,0), (0, 1) and (1/2, ^/S/2). By Theorem 8.98 the von Koch curve is essentially self- 
similar and has a nonintegral dimension d given by 



4 




= 1 , 



log 
log 3 



1 . 



See Figure 8.38 for the first iterations starting from the segment [0, 1] on the real axis. 




Figure 8.43. The first three iterates going to snowflake starting from the square of 
vertices (0,0), (1,0), (1,1) and (0,1). 
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Figure 8.44. Some stable configurations: beehive, snake, long boat, longship. 

8.3 Two-Dimensional Dynamical 
Systems 

Of course, we have no chance here to discuss multidimensional discrete pro- 
cesses. We confine ourselves to commenting two quite popular processes: 
the game of life and the dynamics of complex maps 

Pc{z) := + c. (8.50) 



8.3.1 Game of life 

The game of life is a well-known dynamical system that was conceived in 
the 1960s by John Conway in Cambridge and has attracted many people, 
especially biologists. A comprehensive presentation of it can be found in 
E. R. Berlekamp, J. H. Conway, and R. K. Guy, Winning Ways^ New York, 
1982. Therefore, here we confine ourselves to saying a few words about it. 

Imagine that the plane is decomposed into square cells, and that each 
of these cells can be left vacant or can be filled with a black disc. A state 
of our system is a distribution of black discs into a finite number of cells. 
The denumerable family of all states is denoted by X. The transition law 
from X to T{x) is defined by applying successively the following three rules 
to x: 

(i) two or three neighbors keep you alive. A cell that is occupied in the 
state X will be occupied in T{x) if and only if it has two or three 
neighbors that are occupied in the state x. 




Figure 8.45. Two 2-periodic states: blinken and beacon. 
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K^2 



K-1 

Figure 8.46. The Julia sets of Kq, K-i, K- 2 - 



(ii) three neighbors create life. A cell that is vacant in the state x will 
be occupied in the state T{x) if and only if it has precisely three 
neighbors that are occupied in x. 

(iii) you die if you are alone or in the crowd. If neither (i) nor (ii) apply 
to a given cell, this cell will be vacant in the state T{x). 

Figure 8.44 shows some of the fixed points of T, while Figure 8.45 shows 
two 2-periodic states. 

But the game of life has a very rich dynamics. For instance, one can 
show: 

(i) the game of life can simulate any computer, 

(ii) there exists at least one garden of Eden, i.e., a configuration that hats 
no predecessor. 




8.3.2 Fractal boundaries 

Let US discuss now very briefiy some of the dynamics of the maps (8.50). 

As we saw, when restricted to the real case, they are conjugate to the 
logistic maps. The complexity of the maps in (8.50) in C therefore may 
better motivate the complexity of the logistic map. 

Let us begin with the case c = 0. The map has a sink ai z = 0 with 
basin of attraction the unit disc {z | \z\ < 1}. Points of := {z | \z\ = 1} 
are mapped into with double argument, while the orbit of any exterior 
point z, |z| > 1, diverges to infinity. 

Quite more complicated is the case c ^ 0. The study of the iterates 
of complex maps begins with the works of Pierre Fatou (1878-1929) and 
Gaston Julia (1893-1978). With reference to the maps Pc{z) = + c, a 

natural question to ask is: which points in C have unbounded orbits? 
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Figure 8.47. The Julia sets of Ki and Ko.s+o.St- 



a. Julia sets 

8.105 Definition. We denote by Jc the set of points in C which have 
bounded orbits 

K, := {z I P^{z) yU oo}. 

The boundary^^ Pc of Kc is called a Julia set. 



8.106 f. As we have seen Kq = {z \ \z\ < 1}. Show then that K-2 = [—2,2]. 

The Julia sets corresponding to c = 0,-2 are the only ones that are 
geometrically simple; for all other values of c the corresponding Julia sets 
are fractal. 

The following theorems, that we state without proof, are due to Fatou 
and Julia. The first shows that the dynamics is fairly controlled by critical 
points. 

8.107 Theorem (Fatou). Every attracting cycle of a polynomial map P 
attracts at least one critical point. 

For instance, a quadratic polynomial has infinitely many periodic cir- 
cles, however at most one may be attractive, as there can only be one 
attractive critical point. The map P-i := — i has a repelling period 2 

point, as P^i(i) = —1 — i, consequently it has no attractor. 

8.108 Theorem (Julia). Let Kc := {z | Pc{z) -h ^}- Then 

(i) Kc is connected^^ if and only if the origin belongs to Kc, 

(ii) Kc is a Cantor type set if the origin does not belong to Kc. 



A point 2 is in the boundary of Kc if in every ball centered at z there are both points 
of Kc and of C \ Kc . 

We recall that a set in C is connected if any two of its points can be joined by a 
continuous curve that is in the set. 
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Figure 8.48. (An approximation of) Mandelbrot’s set. 



b. Mandelbrot set 

In consideration of the relevance of the orbit of the origin, 

8.109 Definition. We define the Mandelbrot set as 

M := |c I 0 is not in the basin of attraction of Pc^. 

8.110 Show that 0, —i € M, while — 1 ^ M. 

It is worth noticing that Julia sets for Pc are never empty and that the 
Mandelbrot set is extremely complex, as it is connected and has a fractal 
boundary. 



8.3.3 Fractals on the computer 

Julia and Mandelbrot sets exert a tremendous esthetic fascination when 
represented on the screen of a computer, and, probably for this reason, 
they have become very popular. 



8.111 The sets Kc» Given c, we can visualize (approximately) the or- 
bits of each 2 : as follows^ We choose a grid of points in the plane and 
compute a fixed number k (say 100, 300 or 1000) of iterates of each point 
of the grid. If the iterates Pc(z), k < remain bounded, \P^{z)\ < |c| + 1, 
we colour z in black; if the orbit becomes unbounded, \P^{z)\ > |c| + 1 for 
some fc, we colour z in white. Notice, in fact, that if \Pc{z)\ > |c| + 1 for 
some fc, then \Pc{z)\ ^ 00 as fc — > 00 . 
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Figure 8.49. A picture of the Mandelbrot set by a computer program. 



8.112 Mandelbrot set. In order to visualize the Mandelbrot set, we 
proceed similarly. But now the points in the grid refer to c and the initial 
value of the orbit is always zero. 

8.113 Julia and Mandelbrot sets in colour. Ifc ^ M, thenPJ^(O) 
cxD as n ^ 00. We colour the point c in the grid according to the number 
of iterates needed to leave a disk of prescribed radius R. We can proceed 
similarly for Julia sets. 



8.4 Exercises 



8.114 Show that 



\j6 + y/Q + VQ + ‘- = 3, + + = 1 + 

1 + 

1 + 






8.115 Discuss the recurrence 



f (T0 = O' > 0, 

Xn+I = n > 0. 



8.116 % Campanato’s lemma. Let 0 :]0, 1] ^ M be an increasing function such that 
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4>{r) < all 0 < r, < 1, 

where A, B, a, (3 are nonnegative constants and 0 < a < (3. Show that 

0(r) < Cr^ , 

C being a constant depending only on a,/3,(l){R). 

8.117 % A useful lemma. Let (j) :]0, 1] — > M be an increasing function such that 

</)(r) < A{R B + for all 0 < r, < 1, 

where A, B, a, 6 are nonnegative constants, 0 < a and 0 < 0 < 1. Show that 

C being a constant depending only on a, 9. [Hint: Apply the assumption to r = rn, 
R = Tn+i, {pn} being a suitable increasing sequence that converges geometrically to 
i?.] 

8.118 Many identities involving Fibonacci numbers {fn} are known. The following 
exercises list some of them. Show the following. 

(i) E"=l/.=/n+2-l. 

(ii) E?=l//=/n/n+l. 

(iii) Cassini identity, fn-ifn+i - fn = (-!)”• 

(iv) E ”=0 (70 = fn- 

(v) Cesaro. E;”=o (p/i = f^n- 

(vi) Lucas, g.c.d. (^fp^fq) — /g.c.d. {p,q)- 

(vii) For all n > 1, the numbers and /^i - fn-l Fibonacci numbers. 

(Viii) + /In- 

Ox) /n+l//n T := (1 + >/5)/2. 

(x) = rfn + fn-l where r = (1 + \/5)/2. 

(xii) ^ (1 + ^)/2- 

(xiii) E7i ^ = 4 - T, r := (1 + VS)/2. 

Jj 

8.119 Let {xn} be the Heaviside sequence, Xn = 1 Vn. Show that Z{x}{z) = 
z!{z-l). 

8.120 f. Let {xn} be the linear increasing sequence, Xn = an Vn. Show that 
Z{x}{z) = az/(z-lf. 

8.121 Suppose that the Z-transform of a sequence a = {an} is a rational function 
near infinity, Z{a}{z) = A{z)/B{z), A{z),B{z) being polynomials . Find the sequence 
a = {un} in terms of A and B. [Hint: Use the Hermite decomposition formula.] 

8.122 Let {xn} be the impulse sequence 

Xn := (0,...,0, 1,1,...). 

h k 

Show that Z{x}(z) = ^h+k-i ^fr- 




8.4 Exercises 



375 



8.123 Let {xn} be a sequence and let k > 1. Define the sequence m := {rrin} to be 
the sequence of traveling /c-means by 

mo := XQ, 

xq + xi 

mi := , 



and for any n > k, 

mu : 

Compute Z{m}{z). 

8.124 Let {xn} be the orbit of a dynamical system governed by a second order 
difference equation. Show that the orbits are asymptotically stable if both the roots of 
the characteristic equation Ai, A 2 satisfy [Aij, IA 2 I < 1. Show that the system is stable 
if and only if either [Ail, IA 2 I < 1 or |Ai| = IA 2 I = 1 with Ai ^ A 2 . 

8.125 ^ f . Let R he 8i rectangle of sides 1 and h < 1. Prom R we cut a square of side 
h and we are left with a rectangle of side h and 1 — h. Then we reiterate the procedure. 
Under what conditions on h will the process never end? 

8.126 Assuming that for \z\ < 1, 

= ao + aiz + a2Z^ H , 

1 — z 

show that an = Notice that an — fln-i = 1/n!.] 

8.127 Assuming that a solution of 

^ {6x^ - 5x + l)y" + 2(12x - h)y' + 12y = 0, 

^ 2 /( 0 ) = 1 , 

. 2 /'( 0 ) = 0 

can be written as anx'^, find the an- [Hint: Show that ttn+2 — 5an+i + fi^n = 0 
V n.l 
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8.131 Show that the fractional part of is not equidistributed, 

I !<„<„, (Mg). 6 11/4.3/41 ^ 

N 

[Hint: Use that u„ ;= ((1 + v'5)/2)” + ((1 - ■\/5)/2)" is an integer.] 
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